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Preface 


L’étude approfondie de la nature 
est la source la plus féconde 
des découvertes mathématiques. 


J.B.J. Fourier (1768-1830) 


Recent technological advances allow us to study and manipulate matter 
on the atomic scale. Thus, the traditional borders between mechanics, physics 
and chemistry seem to disappear and new applications in biology emanate. 
However, modeling matter on the atomistic scale ab initio, i.e., starting from 
the quantum level, is only possible for very small, isolated molecules. More- 
over, the study of mesoscopic properties of an elastic solid modeled by 107° 
atoms treated as point particles is still out of reach for modern computers. 
Hence, the derivation of coarse grained models from well accepted fine-scale 
models is one of the most challenging fields. A proper understanding of the 
interaction of effects on different spatial and temporal scales is of fundamental 
importance for the effective description of such structures. The central ques- 
tion arises as to which information from the small scales is needed to describe 
the large-scale effects correctly. 

Based on existing research efforts in the German mathematical community 
we proposed to the Deutsche Forschungsgemeinschaft (DFG) to strengthen 
the mathematical basis for attacking such problems. In May 1999 the DFG 
decided to establish the 


DFG Priority Program (SPP 1095) 
Analysis, Modeling and Simulation of Multiscale Problems. 


After another reviewing process the official start in September 2000 involved 
about 25 research groups all over Germany, see 


http: //www.mathematik.uni-stuttgart .de/~mehrskalen 


VI Preface 


for information about these groups and their activities. In the sequel, this 
program has inspired a number of multiscale initiatives worldwide, for exam- 
ple, the Center for Integrative Multiscale Modeling and Simulation (CIMMS) 
at CalTech, the Bath Institute for Complex Systems and the European Net- 
work Multi-scale modeling and characterization of phase transformations in 
advanced materials (MULTIMAT). 

The aim of our priority program has been to combine different expertise in 
the mathematical community and to enhance the development of mathemat- 
ical methods and concepts for the study of multiscale problems. On the one 
hand, these problems are driven by needs in applications in physics, chem- 
istry, or biology. On the other hand, the challenge to bridge theories at differ- 
ent scales leads to deep and fundamental questions within mathematics. This 
book tries to span this variety of the subject by surveying and highlighting 
the work done in the groups of the priority program. In the vast mathemati- 
cal area of multiscale problems the program has focused its efforts on several 
specific fields that can be characterized by the following subjects: 


e weak-convergence methods, relaxations, and Gamma convergence 

e Young measures, gradient Young measures, Wigner measures 

e homogenization, averaging, and adiabatic evolution 

e singular perturbations, boundary layers, interfaces, and point defects 

e formation, stability, and dynamics of sharp interfaces 

e microstructures in elastic solids, thin films or rods, and in micro-magnetism 
e derivation of continuous models from spatially discrete models 

e quantum-chemical and quantum-mechanical models 

e quantum and semi-classical models for semi-conductors and their coupling 
e numerical algorithms for microstructures in solids 

e exponential integrators for Hamiltonian systems with fast oscillations 


e almost invariant sets and metastability 


Most of the work reported in this book has been supported substantially by 
the DFG through the priority program. The major part of the funding went 
into positions for PhD students and post-doctoral researchers. In fact, this 
program has been an ideal way to educate a new generation of young scientists 
to cope with the future challenges in mathematical modeling, analysis and 
numerics. We are glad to see that about half of the contributors to this book 
are such young scientists. The additional support for travel and the guest 
program has proved to be crucial for the interaction and mutual stimulation 
of the participating groups, in particular the annual colloquia as wells as a 
large series of specialized workshops. 

Bernhard Nunner and Frank Kiefer from the DFG made our life as easy 
as possible concerning the administrative duties while pushing our scientific 
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achievements to new frontiers. In this goal, they could rely on the steady 
support and expertise of the referees Wolfgang Dahmen, Gero Friesecke, 
Thomas Y. Hou, Bernhard Kawohl, Volker Mehrmann, Umberto Mosco, 
Marek Niezgódka, Tomaš Roubíček, Stefan Sauter, Valery P. Smyshlyaev, and 
Gabriel Wittum. These referees have been valuable advisors for the projects 
in this program and played an essential role in shaping our view of the math- 
ematics for multiscale problems. 

Finally, Alexander Mielke would like to thank Stefanie Siegert. Her ded- 
ication and precise management was often hidden in the background, but 
constituted an essential part of the smooth coordination of the program. 


Of course, such a priority program on multiscale problems cannot be suc- 
cessful without the creative work of the involved researchers and without their 
openness for collaboration. We, as the initiators of this program, are grate- 
ful to all these companions in this endeavor and hope that the achievements 
within the program will prove to be rewarding for all of them. 
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Exponential Estimates in Averaging and 
Homogenisation 


Karsten Matthies 


University of Bath, Department of Mathematical Sciences, Bath, BA2 7AY, 
United Kingdom. K.Matthies@maths.bath.ac.uk 


Summary. Many partial differential equations with rapid spatial or temporal scales 
have effective descriptions which can be derived by homogenisation or averaging. 
In this article we deal with examples, where quantitative estimates of the error is 
possible for higher order homogenisation and averaging. In particular, we provide 
theorems, which allow homogenisation and averaging beyond all orders by giving ex- 
ponential estimates of appropriately averaged and homogenised descriptions. Meth- 
ods include iterated averaging transformations, optimal truncation of asymptotic 
expansions and highly regular solutions (Gevrey regularity). Prototypical examples 
are reaction-diffusion equations with heterogeneous reaction terms or rapid exter- 
nal forcing, nonlinear Schrödinger equations describing dispersion management, and 
second-order linear elliptic equations. 


1 Introduction 


Many classical multiscale problems are modelled by (partial) differential equa- 
tions which are heterogeneous in space (explicitly depending on the space 
variable x) or nonautonomous (explicitly dependent on time t). The multi- 
scale character is introduced, if these dependencies are rapid, i.e. they are 
on a fast scale. Typically, this is achieved by introducing a small parameter € 
and considering dependencies of the form «/e or t/e. Application areas, where 
such descriptions are used, include continuum mechanics, chemical reactions, 
nonlinear optics, ecology, and celestial mechanics among others. The w/e de- 
scribes e.g. the underlying varying microstructure of the medium, whereas an 
t/e dependency is used for rapid external excitations. 

Then there are two main approaches to such problems when comparing 
the heterogeneous/ nonautonomous partial differential equations with their 
homogeneous/ autonomous counterparts. Firstly one can identify effects that 
are created by the multiscale structure. Secondly one can try to find effective 
descriptions by homogeneous and autonomous equations without any explicit 
multiscale structure. This process is usually called homogenisation for space- 
dependent problems and averaging for time-dependent problems. Of course, 
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these two approaches are related, as effective descriptions can be used to 
estimate the size of effects. 

There are many different methods to derive the homogenised or averaged 
descriptions, these can be grouped roughly into two groups. The first one 
is using weak convergence methods (see e.g. [Tar79, Tar86, Bor98, GM*97, 
JKO94]), which are applicable to many problems but which usually do not 
provide a quantitative bound on the effectiveness, i.e. on the error of ap- 
proximation. The next group of methods is based on asymptotic expansions 
in the small parameter e. When ruling out purely formal methods, then the 
rigorous asymptotic techniques can be also used to derive quantitative er- 
ror bounds on the approximation error. These will be then in some finite 
or even exponential order in £. Some classical methods can be found in 
[AKN97, BLP78, BP84, JKO94, LM88]. To derive such estimates, the finer 
analysis often also requires additional assumptions on the structure of the 
differential equation. 

Besides certain regularity assumptions on the solutions, we will mainly 
consider cases where the underlying microstructure and external excitation 
is either periodic or quasiperiodic. When expanding a rigorous finite order 
procedure into a series, even in a very benign example, like periodic averaging 
of analytic ordinary differential equations 


t= f(z, t/e), 


convergence of the expansion cannot be expected in general. Nevertheless, 
beyond a finite asymptotic expansion, there are exponential estimates in sev- 
eral aspects, early examples are by Nekhoroshev, and Neishtadt; see, e.g., 
[Nek79, Nei84, LM88]. These then yield upper estimates on all kinds of effects 
created by the periodic nonautonomous structure. 

Here, the purpose of this article is to describe a number of asymptotic 
techniques which provide effective descriptions up to exponentially small er- 
rors for wide classes of multiscale problems. The remainder of the article 
has the following structure. First we will provide a list of prototypical par- 
tial differential equations and interesting solutions involving multiple scales 
in Sect. 2. Then we will describe several finite order methods like classical av- 
eraging, homogenisation and their relation to normal forms in the dynamical 
systems literature (Sect. 3). Detailed results on techniques for upper exponen- 
tial estimates for the earlier examples will be given in Sect. 4. In Sect. 5, we 
will use these results to estimate several possible effects. We conclude with a 
discussion, where we are e.g. describing some situations, where lower estimates 
are possible, Sect. 6. 


Exponential Estimates in Averaging and Homogenisation 3 
2 Examples 


We provide some typical examples for which effective descriptions beyond 
every order will be possible. For larger classes of examples and more general 
assumptions, we refer to the relevant papers [Mat01, MS03, Mat04, KMS06]. 


2.1 Partial differential equations with rapid time dependence 


The basic example of nonautonomous differential equations is the periodic 
ordinary differential equation 


ù = f(u,t/e) with f(.,7) = f(.,7 +1) foral TER (2.1) 


To describe more general dependencies than the purely periodic one, we intro- 
duce the phase ¢ € T™ = (R/Z)™,m € N. Then we rewrite the autonomous 
equation as 


ù= flug) 
$ = 20(u,6) (2.2) 


with R(u, $) > co > 0. Then m = 1, 2(u,d) = 1 recovers the periodic case. 
Results on averaging in this situation, under certain assumptions on 2, can 
be found in [BM61, SV85]. Exponential estimates are due to Neishtadt [Nei84] 
for m = 1 and to Simó [Sim94] for m > 1 and 


N(u, $) = w E€ R” with Diophantine conditions on w 
\(€,w)| > y\é|~7 for some y > 0,7 >m — 1 and all 4 € Z™ (2.3) 


Partial differential equations with such a structure can be found in the 
context of systems of reaction-diffusion equations 


us = DAu+ f(u) + gu, 4) 
b= = 2(u, 6) (2.4) 
(w(0), 6(0)) = (uo, do) € X x T™ 


with D = diag(d,,...,dn). We will consider the functiion u on [0,1] with 
periodic boundary conditions. Initial conditions are in the phase space X = 
H.,.({0, 1]¢,R”) with s > d/2 to ensure the embedding of X into C° and 
differentiability of the nonlinearities in X. Reaction diffusion equations are 
the prime example for pattern formation, for a review see [FS03]. An external 
forcing can be introduced in light sensitive reactors by periodic changes, see 
e.g. [SSW99, RM*03] for a framework and an example. 

Another example are nonlinear Schrödinger type equations, which e.g. de- 
scribe the evolution of pulses in optical fibres [NM92]. The evolution then de- 
scribes the changes of the pulse while propagating along the fibre, so changes 
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in the material due to dispersion management and localised amplification are 
described as 


$ = £2u) + glu, 9) (2.5) 


As a phase-space we use as in (2.4) some Sobolev space, denoted again by X. 
Variants describing general interaction between fast oscillations and pulses 
can be also described in the setting of Hamiltonian PDE. For the necessary 
frame work and the description of non-adiabatic coupling, we refer the reader 
to [MS083]. 


2.2 Partial differential equations with rapid space dependence 


First we consider a heterogeneous version of (2.4), where the heterogeneity 
is in the reaction term, examples of these can be found in several modelling 
areas [BHR05, Kee00, KS98]: 


ut = Ar yu + f(u, x/e) (2.6) 


We will be in particular interested in the behaviour in infinite cylinders, i.e. 
x € Rand y € X with X a bounded cross-section. In particular, let X = [0,1]? 
with periodic boundary conditions in y. When looking for stationary solutions, 
we obtain 


Ar yu + f(u, x/e) = 0. (2.7) 


We rewrite the equation and use the idea of spatial dynamics. It is a way to 
construct special solutions to PDE on unbounded domains. For this we let 


ve (£) ; A= (a, a) ; FU,2/e) = (sez) 


Renaming x as time t, we again obtain an equation 
U, = AU + F(U,t/e), (2.8) 


which has the form of a rapidly forced evolution equation. The phase space X 
is a function space on the cross-section X like X = H*t1(X7,R”) x H(X, R”). 
Even if the Cauchy problem is not well-posed, this method of spatial dynamics 
has a long history, see [Kir82] and further work (see, e.g.[IM91, AM95, FS03] 
and the references therein). 

When considering travelling waves in heterogeneous media one is using the 
ansatz 

u(x, y, t) = v(x — ct, y, x/e), 
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i.e. the profile v of the travelling wave is changing periodicly while moving 
through the periodic medium. This can be also formulated as a spatial dy- 
namics problem, for details see [MSU06]. 

Variants also include heterogeneities in the main part like in classical ho- 
mogenisation theory. The homogenisation of heterogeneous second-order el- 
liptic equations is appearing in many stationary problems, consider 


-V - (A(a/e)Vu)(x)) = f(x). (2.9) 


The matrix A € L~©(T*) on T? = (R/Z)? is assumed to be symmetric and 
uniformly elliptic. Furthermore we assume boundary conditions for x in some 
bounded domain, here again periodic boundary conditions. 


3 Finite order estimates and normal forms 


The basic idea of this approach is to transform the equation to derive an 
effective, simpler version of the differential equation. The method proved to 
be very successful in the analysis of finite dimensional dynamical systems 
[LM88, SV85]. Now we consider problems, that can be written as an evolution 
equation with external forcing like (2.4,2.8) and under further assumption also 
(2.5) (see [Mat04]). More examples and references can be found in [Mat04, 
Ver05, Ver06], including other variants of explicitly time dependent partial 
differential equations and other near-identity transformations to obtain the 
form in (3.1). We consider 


Ut = Au + f(u, @) 
$ = =u,9), (3.1) 


where u € X for some appropriate phase space X. Then a near-identity trans- 
formation on a ball Br(X) can be written in the form 


u=v+eW(v, o) (3.2) 
The transformed equation can be derived from 
Oru = Opv + EOW (v, d)Opv + cð W (v, 6) Od 
then 
pv =(I + EOW (v, ¢))*{A(v + eW(v, )) + f(v + EW (v, 4), ġ) 
- c,W(v,d) = 2, d} 


So depending on the form of 92, one can try to reduce the externally forced 
term f by an appropriate choice of W. It is notational convenient to split f 
such that 
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f(v, o) = f(v) + g(v, o) such that 
w= f sovay =o. 


In the simplest case of periodic external forcing, i.e. m = 1, 22 = 1, we let 


$ 
Woe) = | gw va, 
0 
then the transformed equation has the form 
Ov =(I + cO.W(v, $)) {Alu + EW (v, 6) + flv + eW (v, 9) 
+ 9(v + eW(v, $), $) — gv, $) }. 


This removes the lowest order nonautonomous terms. A problem is to estimate 
the remainder r, if we rewrite the equation as 


vu, = Av + f(v; £) +r(v, de). (3.3) 


The remainder involves terms depending on the unbounded operator A, such 
that r is only formally small. A more promising variant is to use some bounded 
Galerkin type approximation to perform the estimates, where the Galerkin 
approximation uy is chosen depending on e. For all our examples, there exists 
a sequence of (Galerkin) projections (Py) en which satisfy 


1. the sequence of projections converges strongly to the identity on the phase 
space X, 
lim Pyu=u in X for all u € X; (3.4) 


N> 
2. the projections Py commute with A on its domain of definition 
Py Au = APyu for all u € D(A); (3.5) 
3. the operator A is bounded on Rg Py, 
|APyulx < N|Pyulļx for all u € X. (3.6) 
The equation with projection Py on the approximation space is 
Opun = Aun + Py f (un, ¢). (3.7) 
Then the transformed equation is 
iwun = (I + côe W (un, p) H Alun + eW (un, o)) + flun + eW (un, g), 4) 
—€0,W (vn, 6) 2n, p) }. (3.8) 


Regrouping again yields 
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avy = Avy + fr(un;e€) + ri(vy, 65) (3.9) 
then the remainder term is of order 


sup €||Avy||llg(u, p), 
lon SR 
for V in a large ball Br(X). The remainder is small for an appropriate choice 
of N(e). E.g. when we are choosing N (e) such that || Avy|| ~ e—!/2, we obtain 
a rigorous estimate on rı € O(e!/?) in the approximation space Py X, this is 
uniform for € —> 0, N(e) > oo. 

The important property is now, that (3.9) has still the form of the orig- 
inal equation (3.7). So the equation can be transformed again to obtain a 
new remainder term r2. This is of order sup)... <p €l|AvnI|||71(un, %)||, such 
that rə € O(e) for the same choice of N(e) with € — 0,N(e) — co. The 
transformed equation has again the same form. Hence this procedure can be 
iterated to obtain arbitrary finite order estimate O(e*), provided we can en- 
sure enough regularity of the nonlinearity. 

In a detailed analysis taking into account all parts of the remainder term 
one can see, that the constant in the O(e¥) remainder will become large with 
k. So this procedure will not lead to a convergent asymptotic expansion. But 
keeping track of the constants in the remainder estimates depending on k and 
€ is crucial for later exponential estimates. 

Here of course, the error of the Galerkin approximation is still to be esti- 
mated. When extending the transformation back to the full space by 


u =v +EP W (Pnv, 9), 


this leads to E 
v = Av + f(v;e) + r(v, de) 


with additional terms in f and r due to the Galerkin approximation. The 
additional terms are all of the form G(v — Pyv). 

Another situation with an explicit choice of W is for m = 1 and Q(v, ¢) = 
(v) +eB(v, $). The phase ¢ will also be transformed. We choose the explicit 
change of coordinate (uy, ¢) = (un, Y) +EW (vn, Y) 


1 
W tow. Wied = (Wate gies | (3.10) 
with f 
1 
W'(vn, Wye) = Ten f g(vn,7)dr 
and 


$ 
Wun, Y; £) = an | B(uNn,T) 
+ Avy Q(un)(Wi1(un,7) — (W* (on, .)))dr, 


where (.) again denotes the T!-average. 
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The third situation is the quasiperiodic case, i.e. m > 1 and Q(v,¢) = w 
with w = (w1,...,Wm) fulfilling Diophantine conditions: there are constants 
y >0 and 7 >m-—1 such that for all £ € ZP 


(E, w) = yle (3.11) 


as for the finite dimensional result (2.3). Then the transformation is given 
terms of the Fourier expansion of the phase dependent term. Letting 


glun, se) = > ge(un;e) exp(i2n(é, ¢)) 


LEZP 


then we transform uy = vy + EW (vy, ġ; £) with 


Wun, ¢; £) = Qni(l w) 


LEZP 


exp(i2m(Z£, d)) (3.12) 


Classical homogenisation theory of (2.9) gives an asymptotic expansion 
(cf. e.g. [BP84]): 
N+2 


ua) = X eu™ nee), (3.13) 
n=0 


where the functions u® (x,y) are required to be periodic in the fast variable 
y. For this problem one can construct in this way a full asymptotic expansion 
with u% adopting the following form (see e.g. [BP84], [CS04]): 


wl (x,y) = D> > Ney) DE onl), (3.14) 


1=0 |k|=1 


where No(y) = 1 and N;(y) are periodic solutions of the main (|k| = 1) and 
higher order (|k| > 1) unit cell problems in y. The functions v,(z), s > 0, 
solve certain recurrent systems of equations in x, see [BP84, KMS06]. This 
cannot be easily rephrased as an iterative procedure but the idea of track- 
ing the dependence of the constants and taking expansion depending on € 
can also be used in the context of (2.9). For other quantitative estimates on 
homogenisation involving quasiperiodic terms see [FV01]. 

The method of iterative transformation procedures are encountered widely 
in dynamical systems theory as the concept of normal forms, see e.g. [AKN97, 
Van89]. An important question is the description of the behaviour near an 
equilibrium of 

ù = Au + f(u), u E€ R”, 


where f is of higher order in u. A particular simple form is sought for f. If 
A has purely imaginary spectrum and is semisimple (i.e. there are no Jordan 
blocks of size 2 or bigger in its complex Jordan normal form), then we obtain 
a normal form transformation by averaging, see [Van89][Sec.2.4]. When we 
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assume that exp(At) is periodic with period T, then all terms can be removed 
by appropriate coordinate changes, except those which are invariant under 
the averaging operator 


7 

wf(u) = z/ exp(— At) f(exp(At)u)dt 
0 

for all u € R. But while removing terms, which are not invariant under 7, one 

also changes certain higher order invariant terms, such that pure averaging 

will not give a correct normal form. 

The relation of averaging and purely imaginary spectrum has also been 
used in the context of wave equations and similar partial differential equations, 
see [Bam03a, Bam03b, Bam06, Kr89, Ver05, Ver06]. Here also the reductions 
to Galerkin approximations were used depending on a small parameter, which 
is introduced via scaling u +> eu to obtain 


Ou = Au + ef(u,e), 


with f(u,e) = 1/e?f(eu) € O(1). The equation is then simplified by averaging 
the semigroup exp(At). Here the name ‘Galerkin averaging’ was introduced. 
The development of ‘Exponential averaging’ in [Mat01, MS03] was indepen- 
dent of this. 


4 Exponential estimates 


We will now collect methods to move from finite order normal form and av- 
eraging transformations to exponential estimates in the framework described 
above. In particular, we obtain results to all orders O(c") and beyond. In 
the last section, we derived iterative estimates on the Galerkin approximation 
space Py X, and obtained error terms due to the Galerkin approximation. We 
will control both errors at the same time. There are three variables to choose 
to minimise the error. Firstly there is €, which is given, then we have the 
choice of the number of normal form steps and there is also the index N of 
the Galerkin approximation. 

The crucial ingredient, which dictates the optimal coupling, is the depen- 
dence of the error on N, which is in our examples a question of regularity. 
As it is a question about the decay of spatial Fourier coefficients for peri- 
odic boundary conditions or the decay of the Fourier transform for problems 
on R2. A class of function spaces is introduced to capture this. We define 
the Gevrey space in the following way: Assuming that there exists a closed, 
densely defined, boundedly invertible operator Io p with domain of definition 


Gov = D(Iy,v) C D(A), (4.1) 


such that Rg Py C Gov, To (Rg Pn) = Rg Py for all N, and 
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To „Au = ATo „u for all u € Rg Py. 
We equip the Gevrey spaces G,,, with the graph norm 
lulos p = lulx + |Tovulx. (4.2) 


For the theorem, we will assume that Gevrey-regular functions u € Go, are 
exponentially well approximated by the Galerkin projections Py, i.e. 


ITZ) (1 — Py)| < Coexp(—a/N”), (4.3) 


for N-independent constants Co(o,v). Now we are in a position to formulate 
the following theorems about exponential estimates. 


Theorem 4.1 (Exponential averaging of parabolic equation under 
periodic forcing [Mat01]). Let the nonlineairties f,g be entire functions 
on R”, let g be continuous on T! and let Q = 1 in equation (2.4). Then 
the equation can be transformed on bounded sets in X by a real analytic and 
time-periodic change of coordinates for 0 < € < £o 


u=v+eW(v,t/e;e) (4.4) 


with W bounded on any ball of radius R in X. The transformed nonau- 
tonomous terms r are exponentially small after a short transient, but the 
equation may contain nonlocal terms f: 


2 ula, = DAv(a,t) + f(v(a, t)) + ft; E(x) +r(w(t),t/e;e)(x), (4.5) 
with v(0) = uo, |v(0)|x < R and t € (0,t*) 
sup |a(v(t))|x < Ceexp(— min(t, c)e7"/%), 
|v(0)|x<R 
sup |f(V(t))|x < Ce +Cexp(— min(t, c)e~!/3) 
|v(0)|x<R 


where C',c,é9 do not depend on uo. 


Proof. For a detailed proof see [Mat01]. We will sketch the proof using the 
transformation of (3.7) in the previous section. Adapting the results of [Nei84] 
and a coupling 

N(eje* =1 


we can estimate the remainder term in the approximation space Py X. For 
this we use a complex extension and a Cauchy estimate. Performing k = 
[e~'*°] transformation steps as in (3.9) and proving that |rj4i| < |r;|/2 for 
j=1,...,k yields that the remainder term on the approximation space is 


[r| < C27" < Cexp(—ce71*%) 
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for an appropriate choice of C and c. The autonomous correction term is at 
most of order O(e). To estimate the effect of the transformation on the full 
space X, we use a regularity result for equation (2.4). Letting A = A and 


Iov = exp(—o|A|”) (4.6) 


we obtain that g i 
t,1/2 ort € 0, t* 
u(t) € eee for t > t* 


as long as |u(t)|x remains bounded. This and similar results can be found 
in [Pro91, TB*96, FT98], adaptions to equations with nonlocal operators as 
they appear in the transformed equations can be found in [Mat01]. Using this 
regularity result and (4.3), we can estimate the additional error terms due the 
Galerkin approximation of the form G(v — Pyv). Thus the overall remainder 
can be bounded by 


Irkcey x + |G(o(t) — Pryeyv(t)) |x 
Tita) + Co exp(— min(t, t*)/N (e)? 
“) + Co exp(— min(t, t*)e~°/?), 


a 


IA 


C exp(—ce~*T 


zii 








II 


C exp(—ce 


Choosing a = 2/3 yields the desired result for an appropriate choice of C and 
c. The other results including the estimate on f are direct consequences of 
the detailed analysis in [Mat01]. 


So far the analysis was about the equation, now we compare the solutions 
of (4.5) with solutions of the truncated equation, 


Ou = Aut f(u) + f(use) (4.7) 


Corollary 4.2 (Gronwall estimates with Gevrey initial data). Let the 
assumptions of Theorem 4.1 hold, and additionally assume that (2.4) has only 
globally bounded solutions. Fix R > 0, the maximal amplitude of the solution. 
Then for any to > 0 there are constants £o(to) > 0, and C” (to), c' (to) > 0 such 
that the following holds. 

Let u(t) be a solution to the truncated equation (4.7) with norm bounded 
by R in the Gevrey space Gi» 1/2, for a time interval 0 < t < to < o0. 

Then there exists a unique solution v(t) on 0 < t < to to (4.5) with initial 
value u(0). Moreover, the solutions are exponentially close in € < €o, 


l(t) = u(t)Lx < Cl exp (ce), 
for all0 < t< to. 
Proof. The difference w(t) = u(t) — u(t) satisfies the equation 


Opw = A(t)w + r(t), 
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where 


A(t) = A+ i; (Quf + Ou F\(ru+ (1—r)w)dr 


and 
r(t) = r(v(t)) — r(u(t)), 


where for the sake of notation, we omitted the arguments t and e. Since Ô, 
f , and r possess bounded derivatives on bounded sets of Gi» 1/2, the result is 
an immediate consequence of a standard Gronwall lemma. Note that Gevrey 
initial data will stay in the Gevrey space G» 1/2, such that there is no transient 
in the exponential estimate. We also note that the estimate on the remainder 
r is in the X-topology, only, such that the closeness result only holds in this 
topology. Furthermore when starting with arbitrary initial data, the transient 
in the exponential estimate would destroy an exponential estimate. 


In the next theorem, we use smooth initial data in G,.1/2, which is defined 
in the same way as in (4.1) with the same function I, as in (4.6). We denote 
a ball of radius R in a Banach space Y by Br(Y). 


Theorem 4.3 (Exponential averaging of Gevrey regular solutions of 
nonlinear Schrödinger equations [MS03]). Consider equation (2.5) with 
analytic nonlinearities C,g. Assume Q(u) > co > 0 for all u € X,¢ € T?. 
Consider initial data in Go 1/2. There exists a near-identity transformation 
I +eW, defined on the ball Br(X)xT?, which eliminates adiabatically the fast 
phase, up to an exponentially small non-adiabatic effect and which is analytic 
on Br(Go,1/2) x T*. In the new variables (v, Y), the evolution equation reads 


iðv = Av + C(v;e) + r'(v, Wye), (4.8) 
Buh = (Ov) + Muse) + 12(0, ye). 


The transformed nonlinearities r!, r2, C and Q are bounded on the ball Ba(X) 
and Br(Go,1/2) respectively, uniformly in 0 < € < £0. 

The non-adiabatic interaction terms r! and r? are exponentially small in 
E, i.e., there exist constants c,C > 0 such that 


Ir" (v, Bs e)Lx + |r? (v, vse) < Cexp (—ce“¥8), (4.9) 


for all v € Br(Go,1/2), and all Y% € T!. The adiabatic corrections C and Q 
are small in Gevrey spaces, 


ICM Nera Ce, — |A(v;e)| < Ce, 


for v € Br(Go,1/2)- 
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Proof. A detailed proof for general evolution equation or Hamiltonian partial 
differential equations can be found in [MS03]. The proof is similar to the proof 
of theorem 4.1. The formal transformations we consider are given in (3.10). 
Estimates on the remainder term in approximation space can be derived in 
the same way as above. For the analysis of the error due to the Galerkin space, 
we use that the initial data are in G,,1/2 and that (2.5) is well-posed on this 
space for finite times. Then the estimates on the non-adiabatic remainder in 
Py Br and the error of the Galerkin approximation can be balanced again to 
obtain the desired exponential estimate. 


A comparable corollary of Gronwall type also holds. 


Theorem 4.4 (Averaging of Gevrey regular solutions of parabolic 
equations under quasiperiodic forcing [Mat04]). Consider the reaction- 
diffusion equation (2.4) with analytic nonlinearities and Q(u, b) = w with Dio- 
phantine conditions (3.11). Then, for any ball of radius R in Go, there exists 
an £o > 0, such that for 0 < € < £o there exists a near identity transformation 
of both Go 1/2 and X to 





Ow = Av + flv) + gv, e) + rv, p,e) (4.10) 
Ow = Ly 


with initial conditions v(0) = uo; 0(0) = A and with g and r both bounded on 
balls in X, furthermore the remainder term is exponentially small on balls of 
the Gevrey space. 


lg(v,e)|x < Cel +1)/(T+3) 
Ir(v,0,£)|x < C(\ulg,,,) exp(—ce7 10t). 


Details, variants and extensions are given in [Mat04]. 


Proof. A major part is already the estimate on the approximation space, see 
[Sim94]. The remainder on this space is O(exp(—ce END), The error 
estimate of the Galerkin approximation is as above of order O(exp(—cN(e)'/?), 
such that the optimal coupling is N(e) = ¢~?/@+7), which yields the expo- 
nential estimate. 


Theorem 4.5 (Homogenisation via spatial dynamics [Mat05]). Con- 
sider (2.7) with analytic nonlinearity f(.,.), which is periodic in the second 
component. Then there exist co > 0, c,C > 0 and at-periodic transformation 
of (2.8) on a ball Br(X) for 0 < € < £o to 


Vi = AV + F(V) + F(V,£) +r(V,t/e,€), (4.11) 


where F,a are differentiable for V € Br(X), nonlinear and nonlocal in 
the cross-section, but local in t. When considering bounded solutions V(.) € 
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BC(R,X) of the original equation (2.8) then the influence of the fast scale 
on V(.) is exponentially small, uniformly on balls in BC(R, X): 


IIr(V(.), -/e,)llacar,x) < Cexp(—ce7/?). 
The correction term can be estimated on Br(X) by 


sup ||F(V(.),€)llacc.x) < Ce. 
V(.)EBx(R) 


Proof. A complete proof of this homogenisation and of variants can be found 
in [Mat05, MW06]. The estimates on the approximation space are similar to 
the estimates in theorem 4.1. Using again the coupling 


N(eje® = 1. 
Then we use again a regularity result. We let A as in (2.8) and use 
Tov = exp(—o|A|”) 


in the definition of the Gevrey norm in (4.1). Then we obtain that the globally 
bounded solutions are in fact highly regular as functions on the cross-section 
Q 

V(t) € Go. 1 


for some o, > 0 with estimates uniform in t. The set of all such solutions 
is sometimes called the attractor of the spatial dynamics equation. Then the 
overall remainder on bounded sets within the attractor can be bounded by 


Irk lx + IGV (t) — Prey V(t)) |x 
~1+@) 4 Co exp(—o./N(e)) 
+ Coexp(—oxe “). 


IA 


S 


C exp(—ce 





—l+a 





II 


s 


C exp(—ce 


Choosing a = 1/2 yields the result for appropriate C and c. The results hold 
in the same way for any function V(.), which is smooth enough, e.g. by being 
a solution of a similar averaged equation. 


Theorem 4.6 (Homogenisation of elliptic operators [KMS06]). Sup- 
pose A € L®(T"), f € Gorse (as defined above) and fpa f = 0 in equation 
(2.9). Let u® be the solution. Then there exist ¢-independent constants C > 0, 
c>0,K > 0, such that for any N ~ ke~ the approximation (3.13) has the 
error bound: 

jue — u ; HAM I| < Cexp(—ce“), 


The proof in [KMS06] is based on a careful analysis of the remainder term 
u* — u£, then the error can be estimated in an exponential way for k = N ~ 
ke ', where only regularity of f is needed, but not on the regularity of the 
matrix A. 
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5 Effects 


In this section we discuss some effects due to the heterogeneous or nonau- 
tonomous structure of the equation. 


5.1 Splitting of homoclinic orbits 


An important difference in the dynamics between autonomous and non- 
autonomous differential equation are the existence of transversal homoclinic 
orbits. So consider a parameter dependent version of (2.4) without rapid forc- 
ing 


bru = Au + f(u, A) (5.1) 


with A € R. A homoclinic orbit is a solution, which is biasymptotic for 
t — +0 to some hyperbolic equilibrium uo, i.e. Auo + f(uo, A) = 0 and 
the operator A+ Df (uo, A) has only spectrum away from the imaginary axis. 
Under some non-degeneracy conditions, this homoclinic orbit will only exist 
for special values Ap. Whereas in the externally forced equation 





Ou = Au + f(u, A) + g(u,t/e) (5.2) 


the stable and unstable manifold will intersect transversally, creating rich dy- 
namics nearby. The main assertion of theorem 4.1 is, that the nonautonomous 
dynamics of (5.2) can be described by the exponentially close autonomous 
equation as in (4.7). It is possible to show by some further analysis that 
the equilibrium persists as a hyperbolic periodic orbit, exponentially close to 
an equilibrium of the truncated equation after the transformation of theo- 
rem 4.1. Then also the phase-portraits with unstable and parts of the stable 
manifolds will be exponentially close, see [Mat01]. From this, it is easy to 
see, that transversality effects can only occur in a small parameter interval 
(AT (e), A+ (e)) with 


|A* (€) =A (©)| S$ C expe), 


such that these effects were called “invisible chaos” [F596]. 

Using functional analytic methods as in [Mat03a] one obtains better results 
for large classes of parabolic equations, by analysing the problem in complex 
time. A more detailed analysis is possible for ordinary differential equations 
including lower estimates, see e.g. [Gel99, HMS88}. 


5.2 Pinning 


Pinning describes a phenomenon in equations like (2.6), where a travelling 
wave does not propagate due to heterogeneous structures in the medium. 
Thus travelling waves will be instead standing waves, i.e. they are solutions 
o (2.7). A pinned wave is then homoclinic or heteroclinic orbit in the spatial 
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dynamics setting (2.8). Pinning occurs when there a transversal intersection 
of stable and unstable manifolds. The general idea is similar to the analysis of 
the transversal intersection above. Here it is more convenient to find solutions 
nearby a pulse Uo as zeros of 





I(V) = &(Uy + V) — A(Uo + V) — F(Up + V) € BC(R, X) 


Using Fredholm properties or exponential dichotomies, the problem is reduced 
to a low-dimensional problem, where the effects of the heterogeneous terms 
can be estimated to be of order O(exp(—ce~!/)), see [MW06]. 


6 Discussion 


In this article we have given an overview on results about averaging and ho- 
mogenisation for partial differential equations beyond every order. The first 
step of this analysis are finite-order averaging estimates, which were iterated, 
to obtain exponential estimates on the transformed equation. With Gronwall- 
type estimates, we can extend this to estimates about the finite time be- 
haviour. For particular solutions, equilibria, stable and unstable manifolds 
and connecting orbits the analysis can be extended to infinite times too. Due 
to the general nature of the procedure there remain several problem, which 
require a finer analysis. 

The construction of the transformation and the remainder terms are it- 
erative so they are not easily computable, but they coincide to finite order 
with their finite order counter-parts. Therefore, it is possible to show by direct 
calculations, that the transformed nonlinearities are typically not-local in x, 
even if the original nonlinearities are local, see [Mat04]. 

A very subtle point are lower estimates for both averaging procedures and 
particular effects. For the general averaging procedure, there is an example in 
[MS03], showing that the averaging results cannot be improved by any other 
“averaging-type” transformation. In particular there is difference between or- 
dinary differential equations and infinite dimensional systems in what kind of 
exponents can be achieved by exponential averaging. In [KMS06], a particu- 
lar simple choice of the heterogeneous matrix allows explicit calculations and 
shows, that the truncation of the asymptotic expansion is optimal. 

For particular effects like the splitting of homoclinic orbits, there are 
several ways to obtain lower estimates in the finite dimensional case, see 
[(HMS88, Gel99]. A crucial part is extending the analysis to complex time and 
analysing the time-singularities in the complex plane. This idea is also used 
in other cases of exponential analysis [BT05, CM05] and references therein. 
Some ways of introducing exponential asymptotics into numerical analysis 
can be found in [MBS00, Mat03b]. The relation of exponential averaging and 
integrable systems is still open, but see [Bam99, P6s99] for related results. 

The analysis here was restricted to cases to some partial differential equa- 
tions with periodic boundary conditions, the needed abstract properties, such 
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that the theorems hold for large classes of evolution equations are given e.g. 
in [MS03, Mat04]. Then these results will hold e.g. for problems on the do- 
main R2. The effect of boundary conditions and of boundary layers [Neu00] 
on exponential homogenisation still remains to be analysed. 
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Summary. Our objective is to describe solidification phenomena in alloy systems. 
In the classical approach, balance equations in the phases are coupled to conditions 
on the phase boundaries which are modelled as moving hypersurfaces. The Gibbs- 
Thomson condition ensures that the evolution is consistent with thermodynamics. 
We present a derivation of that condition by defining the motion via a localized 
gradient flow of the entropy. Another general framework for modelling solidification 
of alloys with multiple phases and components is based on the phase field approach. 
The phase boundary motion is then given by a system of Allen-Cahn type equations 
for order parameters. In the sharp interface limit, i.e., if the smallest length scale 6 
related to the thickness of the diffuse phase boundaries converges to zero, a model 
with moving boundaries is recovered. In the case of two phases it can even be shown 
that the approximation of the sharp interface model by the phase field model is of 
second order in 6. Nowadays it is not possible to simulate the microstructure evolu- 
tion in a whole workpiece. We present a two-scale model derived by homogenization 
methods including a mathematical justification by an estimate of the model error. 


1 Introduction 


Solidification of alloys based on iron, aluminum, copper, zinc, nickel, and 
other materials which are of importance in industrial applications involves the 
occurrence of structures on an intermediate length scale of some um between 
the atomic scale of the crystal lattice and the typical size of the workpiece. 
This so-called microstructure consists of regions (in the following labelled 
phases) differing in the crystalline structure, in the composition or only in the 
orientation of the crystal lattice, and it is responsible for a broad range of 
material properties and, hence, for the quality and durability of the material. 

Being a result of the solidification process the microstructure is not in 
thermodynamic equilibrium. Its formation is classically modelled using mov- 
ing hypersurfaces for the phase boundaries. The Gibbs-Thomson condition 
couples the form and the motion of the interface to its surface energy and 
to the local thermodynamic potentials of the adjacent phases. In addition, 
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balance equations for the internal energy and the concentrations of the com- 
ponents have to be taken into account. This leads to diffusion equations in 
the phases and jump conditions on the moving phase boundaries. 

In the last years, the phase field approach has emerged as a powerful tool 
to simulate microstructure formation. Phase field variables are introduced 
standing for the presence of related phases. Instead of jumping across the 
phase boundaries, the phase field variables and all the thermodynamic quan- 
tities change smoothly but rapidly within a narrow transition layer. It scales 
with a new length scale 6 smaller than the typical scale of the microstructure 
to be described. This leads to the notion of a diffuse interface in contrast to the 
sharp interface model with the moving phase boundaries. The Gibbs-Thomson 
condition is replaced by a diffuse version which can be viewed as a gradient 
flow of an appropriate Ginzburg-Landau energy. The balance equations for 
the conserved quantities are reformulated in terms of the new variables where 
the jump conditions enter in a natural way. As a main advantage, numeri- 
cally simulating microstructure formation is restricted to solving a system of 
parabolic differential equations, and explicit tracking of the phase boundaries 
in the sharp interface model is avoided. 

The limit of vanishing diffuse interface thickness, i.e., the limit as ôN 0, 
is of particular interest. The first question is whether a related sharp interface 
model is obtained in the following sense: given solutions to the diffuse interface 
model for every 6, is there a limit of the solutions, and which equations do 
the limiting fields fulfill? This question is related to the calibrations problem 
when quantitatively investigating a certain alloy. Usually, material parame- 
ters such as latent heats, surface tensions, and several mobility and diffusion 
coefficients entering the sharp interface model are measured in experiments, 
and the question is how they should enter the diffuse interface model. 

Problems involving multiple length scales not only result from the mod- 
elling approach but are also inherent in the physical problems itself. Diffusion 
of the temperature is much faster than mass diffusion. Because of the bound- 
ary conditions — solidifying workpieces are usually cooled from outside — and 
the release of latent heat the temperature field is expected to suffer changes 
over a scale proportional to the size of the the workpiece. On the other hand, 
the concentrations of the components should exhibit strong gradients only 
near the solidification front. The available numerical techniques and compu- 
tational power only allow for the simulation of small domains in acceptable 
computation time, the direct computation of the microstructure of a whole 
workpiece is not feasible. For the latter purpose, macroscopic models involv- 
ing heuristic assumptions on the distribution of the solidified parts and the 
released latent heat have been in use. Newer mathematical methods are based 
on a two-scale approach and allow for effective, homogenized equations for 
the temperature distribution but also for taking the microstructure evolution 
into account. 

The structure of the present article is as follows. The first section is dedi- 
cated to models for alloy solidification. First, the governing equations from the 
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classical approach for modelling alloy solidification are presented. In particu- 
lar, the Gibbs-Thomson condition is derived by locally varying the entropy. 
After, the phase field approach is presented. In the second section, the rela- 
tion between the sharp and the diffuse approaches is elucidated. Comments 
on the calibration problem are given including appropriate potentials for the 
phase field model with good calibration properties. Exemplary, a model for 
a binary alloy is derived. In the third section, a mathematically rigorous ap- 
proach to the derivation of homogenized models for phase transitions with 
equiaxed dendritic microstructure is given. Asymptotic expansions are used 
to derive a macroscopic heat equation coupled to microscopic cell problems 
for the dendritic growth. A mathematical justification is carried out, i.e., an 
estimate is established comparing the solution to the two-scale model with 
that to the original model. 


Acknowlegdments. This work has been supported by the German Research 
Foundation (DFG) through the Priority Program 1095 “Analysis, Modeling 
and Simulation of Multiscale Problems”. We thank Britta Nestler and Frank 
Wendler for the inspiring discussions, mainly emerging from numerical simu- 
lations based on the presented phase field model (see also their contribution 
on page 113). 


2 Models for alloy solidification 


The production of certain microstructural morphologies is often achieved by 
imposing appropriate conditions before and during the solidification process. 
In order to get a deeper understanding of the process, the scientific challenge 
is to describe the microstructure formation with a mathematical model where 
the imposed conditions enter the equations governing the evolution as initial 
and boundary values or as additional forces and parameters. 

A framework for continuum modelling of alloy solidification can be derived 
from thermodynamic principles for irreversible processes (cf. [Mu01]). Balanc- 
ing the conserved quantities energy and mass respectively concentrations of 
the components yields diffusion equations in the bulk phases as well as con- 
tinuity and jump conditions on the moving phase boundaries. A coupling of 
the phase boundary motion to the thermodynamic quantities of the adjacent 
phases, the Gibbs-Thomson condition, is derived by localizing an appropriate 
gradient flow of the entropy. The balance equations and the Gibbs-Thomson 
condition, together with certain angle conditions in junctions where several 
phases meet and which are due to local force balance, enable to show that the 
local entropy production is non-negative and to prove an entropy inequality. 

An entropy functional involving bulk and surface contributions plays a cen- 
tral role also in non-equilibrium thermodynamics. In the phase field approach, 
the interfacial entropy (or energy) is modelled with the help of a Ginzburg- 
Landau type functional. Evolution equations for the phase fields can then be 
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derived as gradient flows (see [FP90]) or within the theory of rational thermo- 
dynamics (see [AP96, Ha06]). A small length scale is involved which is related 
to the thickness of the interfacial layers. 

We proceed as follows. First, the classical approach to model alloy solid- 
ification, namely with moving phase boundaries, is presented. In the second 
subsection, the Gibbs-Thomson condition is derived. After, the phase field 
variables are defined, and the phase field approach is presented. As an ex- 
ample, a model for non-isothermal solidification of a binary alloy involving 
two phases is derived. Finally we briefly comment on the solvability of the 
differential equations of the phase field model. 

For general informations on the theory and models of phase transitions 
we refer to the books [BS96, Vi96]. In this section, partial derivatives some- 
times are denoted by subscripts after a comma. For example, $e is the partial 
derivative of the function s = s(e,é) with respect to the variable e. 


2.1 Classical approach with moving hypersurfaces 


An alloy of N € N components occupying an open domain 2 € R?, d = 1,2,3, 
during some time interval Iy = (0,7) is considered. Changes in volume or 
pressure are neglected (cf. [Ha94], Sect. 5.1). Moreover, the mass density is 
assumed to be constant (only concentrations will be considered). The only 
transport mechanism is diffusion, and there are no forces present leading to 
flows or deformations. Such effects can strongly influence the growing struc- 
tures (cf. [Da01]). The applicability of the model presented in the following is 
therefore restricted to cases where such effects can be neglected. 

Let M € N be the number of possible phases. The domain 2 is decomposed 
into sub-domains 12;(t),..., Qm (t), t € Iz, which are called phases. The phase 
boundaries Iyg(t) := Qa(t) N Qe(t), 1 < a # B < M, are supposed to be 
piecewise smooth evolving points, curves, or hypersurfaces, depending on the 
dimension (cf. Def. A.1 in the Appendix). The unit normal on Ig pointing 
into phase Ng is denoted by vag. If d > 2 the intersections of the curves or 
hypersurfaces are denoted by Tags(t) := Nalt) N Qg(t) N Qs(t) for pairwise 
different a, 3,6 € {1,..., M}, and the points where the phase boundaries hits 
the external boundary by Tag,ert(t) := Qa(t)N a(t) NOM. If d = 2 then Tags 
is a set of triple junctions, i.e., piecewise smooth evolving points. If d = 3 
triple lines can appear which are piecewise smooth evolving curves. 

During evolution, it may happen that one of the phases vanishes, namely 
if the adjoining phase boundaries coalesce. It is also possible that a piece of 
a phase boundary vanishes so that one of the sets Tags includes a quadruple 
point or line. Typically, the latter configuration is not in mechanical equi- 
librium and will instantaneously split up forming new phase boundaries (see 
[GNS99, BGN06]). It is supposed that such singularities only occur at finitely 
many times t € Iy during the evolution. This is why only piecewise smooth 
evolution is assumed. The evolution equations stated in the following are only 
valid for times at which no such singularity occurs. 
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Before proceeding let us introduce some notation. For K € N define the 
sets 


K 
HE. fo ERK: Dus ih, 5K = fv CHEK : v; >20 vit. (2.1) 
i=1 
The tangent space on HY’* can be naturally identified in every point v € HY* 
with the subspace 


K 
T,HSK & TSK := {w ERX: So w= o}. (2.2) 
i=l 


The map PX : R* — TX“ is the orthogonal projection given by 


DELAK T. 
ENA = eel ) 
P*w (we oe) wa ( K-K K®@1K)w 


where 1x = (1,...,1) € RË and Idx is the identity on R*. 
The following bulk fields are considered in the phases Ra, 1<a< M: 


c7 : concentration of component i, 1 <i < N, 

co := e° : internal energy density, 
f~ : (Helmholtz) free energy density, 
ps : chemical potential of component i, 1 <i < N, 
T@ : temperature, 

s“ : entropy density, 


Up := F&a : inverse negative temperature, 


Us = — : reduced chemical potential difference of component i, 1 <i < N. 





On the interfaces Tag, 1 < a # P < M, there are the following surface fields: 


Vap : unit normal pointing into Qg, 
Tag(Vap) : surface tension, 
Yoo(Vag) : capillarity coefficient, 
Map(Vas) : mobility coefficient, 

Veg : normal velocity towards vag, 


Kag : Curvature. 


The concentrations fulfill the constraint ¢* = (c%,...,¢%,) € XN. Follow- 
ing [Mu01], Sect. 11.2, the evolution in the phases is governed by balance 
equations for the conserved quantities, i.e., 
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N 
nef =-V-JIZ=V-(SOLEVue], O<i<N. (2.3) 
j=0 
Let us briefly comment on the fluxes J. In thermodynamics of irreversible 
processes the relations between the fields are based on the principle of local 
thermodynamic equilibrium. In the present situation the entropy density s“ 
is a function of the conserved quantities. Its derivatives are the inverse tem- 
perature and the chemical potential difference reduced by the temperature, 
i.e., 





1 a“ 
s“ = s*(e*,é%) and ds“ = gate" + Ta . dê” = —u®-c™. 
In the above equation the identity p% = PNw* was used, where u = 
(ut,...,u%)". The fluxes are postulated to be linear combinations of the 


thermodynamic forces Vu, 0 < j < N, with coefficients Li; which may 
depend on the thermodynamic potentials u% or on the conserved quantities 
c7. This phenomenological theory was already introduced in [On31]. It is as- 
sumed that the matrix L = (Le) =o is positive semi-definite. To fulfill the 
constraint ĉ% € XN it is required that pale Le, = 0,1 <7 < N, which also 
means that Y; JE =0. 

Onsager’s law of reciprocity states the symmetry of L and can be proven 
and experimentally observed if the fluxes and forces are independent (cf. 
[KY87], Sect. 3.8). The above fluxes are not independent. But even in the 
present case Onsager’s law can be shown to hold by a certain choice of the 
coefficients (see [KY87], Sect. 4.2, and the reference therein; there the calcula- 
tion is performed for the isothermal case, but another additional independent 
force can be taken into account without any problem). We remark that, con- 
sidering J; — Jy, the definition of the fluxes as above is equivalent to the 
definition in [Mu01], Sect. 11.2. 

On the phase boundaries Tag the continuity conditions 


[uJ2 =0, O<i<QN, (2.4) 


have to be satisfied. Mass and energy balance imply furthermore the jump 
conditions 
[cil2vap = [ie vas, O<E<N. (2.5) 


Here, [-]2 denotes the jump of the quantity in brackets across Tag, e-g., [e] = 


The matrix of surface tensions (ds(V))a,g is symmetric for every unit 
vector v (the diagonal entries are not of interest and may be set to zero). The 
relation between surface tension and capillarity coefficient is given by 


Tap(Vas) 


Yop (Vag) = Tref 
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with some reference temperature Tef. The surface tensions are one-homo- 
geneous in their argument while the mobility coefficients mag(Vag) are zero- 
homogeneous in their argument. 

The evolution of the phase boundaries is coupled to the thermodynamic 
fields by the Gibbs-Thomson condition 


N 
N B 
Mop(Vas)Vas = —Vr : Dyap(Vvas) + |- uo f(T, ĉ) + Lual (2.6) 
i=1 


which is derived in the following subsection. By Vpr -+ the surface divergence 
is denoted. In the case of an isotropic surface entropy, i.e., Yaa(V) = YoglY| 
with some constant Y,, independent of the direction, there is the identity 
-Vr : Dyap(v) = FagKap Where Kag is the mean curvature. 

To avoid wetting effects (cf. [Ha94], Sect. 3.4, for a discussion and refer- 
ences) the surface tensions are assumed to fulfill the constraints 


Cap + OBS > Cas. (2.7) 
Capillary forces acting on Tag are related to the vectors (cf. [CH74, WM97]) 


£a8(Vae) = Doas(Yas) = Cap (Vas Wag + Dga-10ap(Vas) (2.8) 


where Dga-1 is the surface gradient on the sphere S¢~!. The identity D = 
Dga-1ı +Vap(Vag: D) was used as well as the fact that cag is one-homogeneous 
implying Doag(Yag) ` Vap = Fae (Vag): 

In points x belonging to Tags forces are in equilibrium. In the three- 
dimensional case Tags consists of triple lines that can be oriented with a unit 
tangent vector Tags(x). If the whole space is cut with the plane orthogonal 
to Tags(a) through x then the picture in Fig. 2.1 is obtained. Due to the sur- 
face tension Tag exerts a force on z which is given by £ag(Vag(X)) xX Taes(2), 
whence equilibrium of forces means that 


O= So fj (viz(x)) x Taps(x) (2.9) 
(i j)EA 


where A := {(a, 3), (8,8), (6,a)}. A short calculation shows that in the situ- 
ation of Fig. 2.1 


Eag (Vap) X Taps = (Voap (Vag) ` Tap)(—Vap) + Cap (Vas) Tap- 


Similarly, if x € Tag,ext there is a unit tangent vector Tog,ext(x), and the force 
acting on z is given by £a¢(Vag(Z)) X Tag, ext (£). Force balance in x implies that 
this force is not tangential to O92. Since it is already orthogonal to Tag,ext(£) 
by definition this is true if and only if 


Eas (Vas (x)) f Vext(£) =0. (2.10) 
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Vv 


õa G 


Fig. 2.1. On the left: triple junction x with orientations of the forming curves; 
such a picture is also obtained in the 3D-case by cutting the space with the plane 
spanned by vap (£), Tag (x). On the right: local situation around a point xo on a phase 
boundary for the derivation of the Gibbs-Thomson condition; a local deformation is 
indicated by the dashed line. 


In particular, angle conditions in Tags and Tag,ext are due to the above force 
balance conditions (2.9) and (2.10). 

To obtain a well-posed problem the governing equations (2.3)—(2.6), (2.9), 
and (2.10) must be provided with initial conditions for the fields and the mov- 
ing boundaries and boundary conditions. If not otherwise stated, the isolated 
case 

JE Vege =00n ON, O0<i<N,1<a<M, (2.11) 





is considered. 
The total entropy of the system being given by 


M 
S(t) = 2 Í geet 


it can be shown that the evolution equations (2.3)-(2.11) imply non-negative 
entropy production: 


M 


f. rawan (2.12) 
a<ß,a,ß=1 Tap(t) 


Theorem 2.1. The entropy (2.12) satisfies 


d N 
— S(t = Vu? . LA Vue dle 
gO = Sf Vat Lge 


1<a<M i,j=0 


+ 5 J Mag(Vag)” dH?-}. 
Tap (t) 


1<a<B<M 


The proof can be found in the Appendix of [GNS04]. 
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2.2 Derivation of the Gibbs-Thomson condition 


In this section a physical motivation of the Gibbs-Thomson condition (2.6) 
based on thermodynamic principles is given. The idea is to define the mo- 
tion of the phase boundaries as a gradient flow of the entropy. On the set of 
admissible surfaces the tangent space of a surface is defined by the smooth 
real valued functions f on the surface supplied with a weighted L?-product. 
A variation of the surface entropy in the direction f is then the rate of change 
of the entropy when deforming the surface towards its normal with a strength 
given by f. Such a deformation of a phase boundary usually changes the vol- 
umes of the adjacent phases. Thanks to this fact the bulk fields can enter the 
Gibbs-Thomson condition. But changes in the conserved quantities must be 
counterbalanced. Since (2.6) is a local motion law, only local deformations of 
an 7-ball around a point xo on a phase boundary are considered. Conservation 
of energy and mass is ensured by taking a non-local Lagrange multiplier into 
account. In the limit as 7 — 0 all terms become local after appropriate scaling 
so that the desired equation is obtained. 

For keeping the presentation simple we do not consider the general situa- 
tion as in the previous subsection but the one depicted in Fig. 2.1. There, I 
is a smooth compactly embedded d — 1-dimensional hypersurface separating 
two phases 2* and Q- with unit normal v pointing into NT. Such a surface 
respectively configuration is said to be admissible. 


Definition 2.2. Let G be the set of the admissible surfaces. The tangent space 
is defined by TrG := C! (T, R). A Riemannian structure on TrG is defined by 
the weighted L? product 


eee = | moyea Vu E ETrG 


where m(v) is a non-negative mobility function. 


The bulk fields for energy density and concentrations, here denoted by œ, 
are allowed to suffer jump discontinuities across I’, but the potentials s e = —u 
are supposed to be Lipschitz continuous. Within the phases Q* and Q27 all 
fields are smooth. 

Variations of the entropy are based on local deformations of the domain. 
Let xo € I and consider the family of open balls {U” }y>o centered in zo with 
radius ņ. Given arbitrary functions €” € Cj(U”) it can be shown that that 
there are vector fields 


E” c CE(U”, R®) with €" = €"v on I” := PAU”. (2.13) 
The solution 6” : U” — U” to 


00, y) =y, (5, y) = E" (0"(—8, y)) for 8 € [-80, ðo], 
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yields a local deformation of U”. The restriction of 6 is such that T” := UNL 
remains a smooth surface imbedded into U”, i.e., the sets 


ry = {0"(6, x) TE PIY, ô € [—5¢, ôo]; 


define an evolving (d — 1)-dimensional surface in U” in the sense of Def. A.1. 
A short calculation yields the identity 


1 det 6”, (5,2) = V - €” (0"(8, x)) det 6”, (6, x). (2.14) 


The functional mapping L!-functions on U” onto their mean value is denoted 
by M”, i.e., 


M” : LU") >R”, M"(f) = ga, fede = | fod 


where |U"| = L4(U") with the d-dimensional Lebesgue measure C4. 


Definition 2.3. Under the local deformation 0” of U” the densities of the 
conserved quantities are 


(5, y) := €°(0"(—6, y)) — M"(°(0"(—6, ‘y= o°(-)), y € U". (2.15) 


The local entropy consists of the bulk part 


Si (6) = f s(c(6, y)) dy (2.16) 


and the surface part 


TORE J YO) H=. (2.17) 


ô 


The Lagrange multiplier M"(c°(0"(—6, -)) — c°(-)) in (2.15) ensures that en- 
ergy and mass are conserved under the deformation. 


Lemma 2.4. The derivative of the bulk entropy with respect to 6 in 6 = 0 is 


Z s30) = 2 (8) +M): 2) 7-6 de. 


Proof. By definition (2.15), the bulk entropy (2.16) is 
f (EOE) -MUP )) =) dy 
Un 
= i 5 (69 (ar) — M(c(8%(—6,-)) — €°)) det 8,2 (8, 2) dx 
Un 
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where the transformation y = 6"(6,7) was used. The equation (2.14) yields 
together with 0”(0, 2) = x and det(6”,(0,x)) = det Id = 1 





d 079n d 0 
nee af = — t0” 
5 Jp ECCO Dda = ae p, CO0) dal, 
= / e (x)V - E(x) dr 
Un 
With s e = —u the desired identity can be shown as follows: 


Z szo = In - (x) — M” (2 (0"(0,-)) - °))V-8%(2) A 
l n 
pets (a) p JU] e (0"(—(:), z)) dz r dx 


=f se ))V - €"(x) dx + mer dae fF O(x)V - E(x) dx 


=f a 9) 4 M”(u) -c 0) V. g'(x) dx 
a 





Lemma 2.5. The derivative of the surface entropy with respect to 6 in 6 = 0 
is 
d 


af n d— 1 
z930 ==]. Vr- Dy(v) € dH 


Proof. Interpreting {I }s as evolving surface, the scalar normal velocity is 
€"” and the vectorial normal velocity is é” = "v. The scalar curvature is 
denoted by kr. Applying Th. A.4 from the Appendix yields (observe that 
the boundary integrals over OJ" vanish since the velocity €” has a compact 
support in U”) 


30) == | (PI -I)E wr)aH 
dô pn 
which is using (A.3), (A.2), (A.4), and the one-homogeneity of y 
z ik (VAE) Vre? + Vv) veep "GN, 
rn 


Applying Th. A.3 to y = Vy(v)E" (again the boundary integral vanishes) and 
again (A.2) on the last term it follows the desired identity: 


n= I (= Vr: VIV) E = nr: Vy) E + Vy): rr €") dH 
pn 


=— i (Vr-Vy(v) é")dHe. 
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As stated at the beginning of this section, the goal is to define the motion 
as a localized version of a gradient flow. This is realized in the following 
definition. Let |I""| := H4-1(1""). 


Definition 2.6. The motion of the phase boundary I’ is defined as follows: 
In each point xo € I the identity 


1 d 
wa fay £N) — Ji gt n 
holds for all families of functions €” € Cj(U") where SR(8) and S3(5) are 
defined by (2.16) and (2.17) respectively. 


Theorem 2.7. The localized gradient flow (2.18) yields the Gibbs-Thomson 
condition (2.6). 


To prove the theorem the following lemma is useful: 


Lemma 2.8. Let g € L®(U”) with g € C1(Q¢+ NU") and g € CHR- NU"), 
and let z € R be given. There is a family of functions {E"}ys0 C C1(U") with 
E (xo) = z for all ņ such that 


1 1 
— V-é"d --f tenant -i f Vg-€"d 
Faf,’ E” da 9 I! £ | Jim Y9 E” da 


> —[9(a)|t2 as 40 


where the functions €" are uniformly bounded and satisfy condition (2.13). By 
g% the limit of g in x € I when approximated from the side QF is denoted. 
Analogously g~ is defined when approximating x € I from Q-, and |g]} = 
g' —g is the difference. 





Proof. The first identity follows from the divergence theorem applied to the 
two parts U"N QF and U” N N7 of U” using that £” vanishes on the external 
boundary OU”. For the limiting behavior consider the functions 


0 on UU, 


Let ¢ be a smooth function with compact support on the unit ball U+ (0) c R? 
such that fga = 1 and define €” by the convolution of €” with n~3¢¢(-/73), 


i.e., N 
E(x) := (M?) * E) (a). 


Then for 7 small enough £” = z on I N U2? =: f”, 

Observe that thanks to the smoothness of I the H¢—!-measure of ["\I"" 
is of order n? whence |I"\I"|/|'"| = O(n) as n — 0. By assumption, the 
function f = [g]* is Lipschitz continuous on I’. Thanks to the special choice 
of €” it can easily be derived that 


Multiscale Problems in Solidification Processes 33 


fE AHT! = -F fzdH?* +4 f(é"—z)dH*" = f(ao)z 

cn cn rn 

as 7 — 0. As moreover the £7-measure of U” is of order n? but the H41- 
measure of I” is of order 77~1 and since |Vg - €"| is bounded in U” the 
assertion follows. 


Proof. (Th. 2.7) First, observe that M"(u) — u(ao) as 7 — 0 since u is 
Lipschitz continuous. Choose some arbitrary z € R and a family of functions 
{€7}y>0 as in Lemma 2.8 and let {€"},,50 be the corresponding vector fields. 
Then 


1 


2 nu) -L(x - E" (x) dx = (u 
Tad ee (x)V + €"(x) dx = M"(u) 


T L (x)V - E(x) dx 


— u(x): [e°(ao)| tz = [u - Ot (zo)z. 


The limit of the right hand side of (2.18) is, using the Lemmata 2.4, 2.5, and 
2.8, 





Tg 88 + SHO 
= Ta a (s(c°) + M'u) - P) V - "dx — Í, Yr: Vy(v) dH? 
= (beite + [F] eo) + [A] eo) Yr veeo) ) « 
= [2] ee —Vr: Voie) Zz. 
For the last two lines the identities c° = (e°, 2), uo = —#, (ui,-..,un)” = $, 


and the thermodynamic relation e = f + sT were applied. The left hand side 
of (2.18) yields in the limit as 7 — 0 


rage = f mey an= — m(v(a0) o(a0)2 


Since z € R can be chosen arbitrarily the condition (2.6) follows in xo. 


2.3 Phase field approach 


In phase field models, the individual phases are distinguished by phase field 
variables. In different phases they attain different values, and interfaces are 
modelled by a diffuse interface layer, i.e., the phase fields and all other ther- 
modynamic quantities change smoothly on a thin transition layer (the diffuse 
interface) instead of suffering discontinuous transitions. 

Let ¢ = (fa)? where each variable a describes the local fraction of a 
corresponding phase a. The vector of these phase field variables is required to 
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fulfill the constraint ¢ € XM. The interfacial contribution in (2.12) is replaced 
by a Ginzburg-Landau type functional (cf. [LG50]) of the form 


-[ (alvo + 5002) dz. (2.19) 


The function a: XM x (TZM)? — R is a gradient energy density which is 
assumed to be smooth and to satisfy 


a(¢, X) > 0 and a(ġ, nX) = nalo, X) Y(¢, X, n) € 2M x (TXY)? x RH. 


The function w : XM — R is smooth and has exactly M global minima at 
the points eg = (dag)a4,, 1 < 8 < M, with w(eg) = 0, i.e., 


w(ġ)> 0, and w(¢) = 0 & ¢ = eg for some 8 € {1,..., M}. 


Possible choices for a and w will be given later. We also refer to the article of 
Nestler and Wendler on page 113. 

The surface contribution to the entropy is described above. Let us now 
comment on the bulk entropy contribution and its dependence on the phase 
field variables. The (Helmholtz) free energy of the system can be defined as 
an appropriate interpolation of the free energies {f°(T,é)}q of the possible 
phases, i.e., 


M 
f(T,2,¢) = Lf (2.20) 


with an interpolation function A : [0,1] — [0, 1] satisfying h(0) = 0 and A(1) = 
1. By the thermodynamic relations s = — f,r and e = f + Ts the entropy and 
the internal energy can be expressed in terms of (T,é,¢). By appropriate 
assumptions on f, inversely, the temperature can be expressed as a function 
in (e, ĉ, $) = (c, $) whence also the entropy, s(c, ¢) = —f,r(T(c, ¢), ĉ, ¢). Short 
calculations taking the change of variables into account yield 


fo(T(69).04) 


salap = —ule, 9), 8,9(69) = —* 


The total entropy of the system is now 


Sted) = f (se.4) (alo, Vo) + 5u(6))) ae 


The evolution of the system is determined by a gradient flow of the entropy 
for the phase field variables coupled to balance equations for the conserved 
variables such that the second law of thermodynamics is fulfilled. To allow 
for anisotropy in the mobility of the phase boundaries, again a weighted L?- 
product is used. Given a smooth field ¢: Q— XM let 


(w, V)w,6 =| wl, Vd) w-vde Yw,v € C®(2;TX™). 
Q 
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The function w is supposed to be smooth, positive, and homogeneous of degree 
zero in the second variable, i.e., 


w(¢,X) >0 and w(ġ, nX) =w, X) V(b, X, n) € D” x ROX x RT. 
The evolution of the system is defined by 


(Obv) = (Fleg) u) Wo E C2, T3), 


Taking the boundary condition 
a Vhal, VP) “Vert =0, 1<a<M, (2.21) 


into account this means that for all a € {1,..., M} 


ôw(ġ, V0) be = 6V: a Voa (¢, Vo) = ÔA ba (¢, Vo) = FY bn (¢) + Spa (c, o) =X 


(2.22) 
with the Lagrange factor (due to the constraint Xa ¢a = 1) 
M 
A= = J+ (5V -avoa ($, VO) — 60,5, (9, V4) — 20,4, (8) + 8,54(6.9) 
M a Voa ry Pa , 5 Pa Qa O; : 


It is also possible to consider multi-well potentials of obstacle type (cf. [BE91}). 
Then the differential equation (2.22) becomes a variational inequality. 
The balance equations for the conserved quantities read 


N 
Ac; = -V- Jile, p, Vulc, d)) =V. (> Lij(c, 6) Vuj(c, o) . (2.23) 

j=0 
The fact that the Onsager coefficients L;;(c,@) can differ in the different 
phases may be modelled by interpolating the coefficients {L}; }a of the pure 
phases analogously as done for the free energy. The matrix L = (Lii) j-0 
then remains symmetric and positive semi-definite. Moreover, the condition 
pena Lij(c,¢) = 0, 1 < j < N, remains satisfied. In addition to initial condi- 
tions boundary conditions are imposed which, in the isolated case, are of the 

form 
Ji(c, d, Vulc, d)) -Vest =0, OSI SN. (2.24) 
In [GNS04] the following entropy inequality is shown: 


Theorem 2.9. If the system under consideration evolves following (2.22) and 
(2.23) then it holds that 


d N M 
Hoe o) > V. (Eus — ô 5 1.6.00) . 
i=0 a=1 


If the boundary conditions (2.21) and (2.24) hold then £5(c, 6) > 0. 
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2.4 Example for calibration: binary alloy, two phases 


The framework for phase field modelling of alloy solidification presented in 
the previous subsection generalizes earlier models that have successfully been 
applied to describe phenomena like dendritic and eutectic growth. By pos- 
tulating appropriate free energies f, surface terms a and w, Onsager coeffi- 
cients Lij, and a kinetic mobility function w, for example, the models used in 
[Ca89, PF90, WMB92] can be derived (see [GNS04, St05b]). In the following 
we will exemplify the choices to model non-isothermal solidification of a binary 
alloy involving a solid and as liquid phase. For more complex cases of multiple 
phases and components we refer to the article of Nestler and Wendler. 

Let M = 2 and N = 2. According to the model of an ideal solution, the 
free energy density of the liquid phase is defined by 














2 
s LT-T; R T 
fO(T,A := Zz Sa Ci + ary” ci log(c;) — “pT lolz). 
and the free energy of the solid phase by 
TATE. Ree T 
f(T, ê) := a z T, ci + a a ci) — cT loe(F)- 


The quantities L4 and Lg are the latent heats of the pure substances A = 1 
and B = 2, T4 and Tg are the melting temperatures, Rg is the gas constant, 
Um the molar volume (supposed to be constant), cp the specific heat, and Tref 
some reference temperature, e.g., the mean value of the melting temperatures. 
In the following, the entropy differences s4 and sg between the phases will 
appear. They are defined by s; := L;/T;, i = A, B. Moreover let R := Rg/vm. 
For simplicity assume that La = s4T,4 = Lp = spTp =: 2L. 

To simplify the presentation further we now consider dimensionless equa- 
tions. Whenever thermodynamic quantities appear in the following, we will 
use the same letters but they are thought to be appropriately rescaled. In par- 
ticular we are able to set cp = 1 and Tref = 1. Interpolating the free energies 
of the pure phases with the interpolation function h(¢) = ¢ in the sense of 
(2.20) yields 








FT, c9) = (aTa -T) + ETs -T)) (61 — 62) 


2 
+ RTŠ_ cilog(ci) — T log(T). 
i=1 
Since ¢1 + ¢2 = 1 and cı + cg = 1 it is sufficient to consider ® = ġı — de 
and C = cı in order to distinguish the phases and to describe the alloy 
composition. We then have ® = 1 in the liquid phase, = —1 in the solid 


Multiscale Problems in Solidification Processes 37 


phase, and C is the concentration of component A. The free energy density 
can then be written in the form 


f(T, C, 8) = f(T,C,1- C, 42, £2) 


=4(Cs4(Ta — T) + (1 — C)sg(Tg -T))® 
+ RT (Clog(C) + (1 — C) log(1 — C)) — T log(T) (2.25) 


resulting in the internal energy density 
é(T,C,®) = $(CL4+(1—C)Lp)®4+T =: L +T. 


Setting Lo; = Lio := 0 for i = 1,2 and Loo := K(®)T? the energy flux 
becomes i 
— Loo Vuo = “RON a = —K(6)VT 


whence the balance equation for the energy reads 


č = O,T + LP = V - (K(#)VT). (2.26) 


Since Fi, = fey — E(f + fea) = Elfe — fee) = tfc we have 


T l sg — SA R 
U2 =u = a =5 3 P+ z (log(C) — log(1 — C)) 
whence i i 
SB— SA 
Vuz = Vu =( 4v “Hia aoa) 








Choosing D(®)C(1 — C) =: L11 = —L12 = —L21 = Loz with some diffusivity 
coefficient D(®) a short calculation gives 


—ð;c2 = Aer = AC = V - (D(S)RVC) 


SB — SA (2.27) 


Pii (Beca -02 


Vo) . 


Subtracting the equations for the two phase field variables ¢; and @2 yields 





ô 
0’ wD = 8 (V - (a vo, —a,Vo2)— (4,41 — 9,40)) — (W,41 —W, 42) EA Joz): 


The standard double-well potential w(ġ) := 9y¢7¢2 for some y > 0 related to 
the surface tension (see below) gives 


(W, — W,42) (45%, 42) = Syp'(@) where p(®) = (8° — 1)”. 
Moreover it holds that 


—4 (fer — fide) = -22f o = -2 (Cs4(Ta — T) + (1 — C)sp(Tp — T)). 
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The surface gradient term is set to a(¢,V¢) := y\¢1Vde — ¢2Vd2|? = 
7|¢V@?. Short calculations give 


4,46, — Ao =0, Vg — 4,742 = 2y(G1V p2 — b2V b2)(o1 — G2) = YVE. 


Finally, let € := 26, Q := F, and replace the surface energy Ty =: o 
by a temperature independent constant (i.e., replace T in that term by some 
reference temperature Tef and assume that variations of ø in the temperature 
can be neglected). Then the evolution of the phase field variable is governed 
by 


EaP = EAS — p' (8) — ZE (Cs4(T4 —-T) + (1 — C)sg(Tg — T)). (2.28) 


The model consisting of equations (2.26)-(2.28) and some additional con- 
ditions will be used in the following section to sketch the method of relating a 
phase field model to a sharp interface model and in the last section to describe 
dendritic solidification. 


2.5 Some remarks on the solvability of the phase field model 


The reduced grand canonical potential is defined to be the Legendre transform 
of the negative entropy with respect to the conserved quantities, 


wu, o) = (—s)*(c(u, $), o). 


With its help it is possible to reformulate the differential equations using (u, ¢) 
as variables (cf. [St05b]). The parabolic system then has the structure 


N 
OW ui (u, o) = V t (> Li; (Yalu, $), svu) , 
j=0 
wle, V0)Oba =V. Q Voa (¢, Vo) Tapa (¢, Vo) =W pa (¢) + Vibe (u, o) -À 


where 0 <i< N,1<a< M. When rigorously analyzing these equations the 
main difficulties arise from the growth properties of w in u and the nonlinear- 
ities involving V¢. 

An ideal solution formulation of the free energy density has the structure 





f(T,c) = Tlog(T) + ae cilog(ci) +... 


As a result, in Y a term —log(—uo) appears. In particular, when solving 
the differential equations it must be shown that uo < 0 almost everywhere. 
Moreover, % is only of at most linear growth in the u;, 1 < i < N. A control 
of terms involving Y „ obtained by standard estimates for parabolic equations 
do not provide much information of u itself any more. These difficulties have 
been independently tackled in [AP92] and [LV83] respectively. 
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Based on those results, the above system including the phase field vari- 
ables is analyzed in [St05b] by approximating ~ with a perturbed potential 
of quadratic growth in u. The main task is to derive suitable estimates and, 
based on the estimates, to develop and apply appropriate compactness argu- 
ments in order to go to the limit as the perturbation vanishes. It is assumed 
that the matrix of Onsager coefficients L = (Lij)ij is positive (on a certain 
subspace) uniformly in their arguments. If a degenerating coefficient matrix 
is considered as in the previous subsection it may be better to switch to (T, ĉ) 
or (e, €) as variables, e.g. cf. [Ec04a]. 

Managing the phase field variables is kept simple in [St05b] by appropriate 
assumptions on a, w, and w. The interesting case of a involving the terms 
baV oe — 3Vea (which give a good approximation of the direction of vag) 
is still open. Non-local models have been considered by multiple authors (for 
instance, we refer to [BS96, SZ03, KRS05]). There, the energy is the only 
conserved quantity, and the difficulties with the logarithmic term in uo are 
tackled by performing a Moser type iteration to get L°°-bounds for ug and 
1/uo . 


3 Relation between the approaches and calibration 


The relation between the phase field model and the free boundary problem 
presented in the previous chapter can be established using the method of 
matched asymptotic expansions. Generalizing methods developed in [CF88, 
Ca89, BGS98, GNS98] this has been done in [GNS04]. The procedure is as fol- 
lows: It is assumed that the solution to the phase field model can be expanded 
in ô-series in the bulk regions occupied by the phases (outer expansions) and, 
using rescaled coordinates, in the interfacial regions (inner expansion). Given 
suitable relations between the functions and parameters of the phase field 
model on the one hand and the parameters in the free boundary problem on 
the other hand the functions to leading order of the 6-series solve the gov- 
erning equations of the free boundary problem. It should be remarked that 
this procedure is a formal method in the sense that it is not rigorously shown 
that the assumed expansions in fact exist and converge. But in some cases 
this ansatz could be verified (cf. [DS95, St96, CC98, Di04]). 

If the phase field model is considered as an approximation of the free 
boundary problem fast convergence with respect to ô is desired. An improve- 
ment of the approximation was obtained in [KR98] in the context of thin 
interface asymptotics. The analysis led to a positive correction term in the ki- 
netic coefficient of the phase field equation balancing undesired terms of order 
6 in the Gibbs-Thomson condition and raising the stability bound of explicit 
numerical methods. Besides, the better approximation allows for larger values 
of 6 and, therefore, for coarser grids. In particular, it is possible to consider 
the limit of vanishing kinetic undercooling which is important in applications. 
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Numerical simulations of appropriate test problems reveal an enormous gain 
in efficiency thanks to a better approximation. 

In [A199] the analysis was extended to the case of different diffusivities in 
the phases and both classical and thin interface asymptotics were discussed. 
By choosing different interpolation functions for the free energy density and 
the internal energy density (the function h in (2.20)) an approximation of 
second order could still be achieved but the gradient structure of the model 
and thermodynamic consistency were lost. Based on those ideas it was shown 
in [An02] that even an approximation of third order is possible by using high 
order polynomials for the interpolation. In [MWA00] an approach based on 
an energy and an entropy functional was used providing more degrees of free- 
dom to tackle the difficulties with unequal diffusivities in the phases while 
avoiding the loss of the thermodynamic consistency. Both classical and thin 
asymptotics are discussed in that article as well as the limit of vanishing ki- 
netic undercooling. In a more recent analysis in [RB*04], a binary alloy also 
involving different diffusivities in the phases was considered and a better ap- 
proximation was obtained by adding a small additional term to the mass flux 
(anti-trapping mass current, the ideas stem from [Ka01)). 

We have shown in [GS06] that, for two-phase multi-component systems 
with arbitrary phase diagrams, there is a correction term to the kinetic coeffi- 
cient such that the model with moving boundaries is approximated to second 
order in the small parameter 6. A new feature compared to the existing results 
is that, in general, this correction term depends on temperature and chemical 
potentials. Indeed, up to some numerical constants, the latent heat appears in 
the correction term obtained by Karma and Rappel [KR98]. Analogously, the 
equilibrium jump in the concentrations enters the correction term when in- 
vestigating an isothermal binary alloy. But from realistic phase diagrams it is 
obvious that this jump depends on the temperature leading to a temperature 
dependent correction term in the non-isothermal case. 

In this chapter, the procedure to get an second order approximation will be 
sketched for a simple model describing solidification of a pure substance. The 
model is based on the model in Sect. 2.4. There, the small quantity € = 26 
was introduced and will be used instead of 6. In addition to the free boundary 
problem which appears as problem to leading order a correction problem to 
the next order is derived by continuing the asymptotic analysis. The goal is 
to obtain that fields identically zero solve the correction problem. It turns out 
that the above mentioned correction term to the kinetic coefficient is neces- 
sary to allow for this solution. The model equations including assumptions, 
asymptotic expansions, and matching conditions are listed in the following 
subsection. After, the asymptotic analysis is performed. Finally, the leading 
order problem and the correction problem are stated. In [GS06], numerical 
tests have been performed to show that a better approximation of the free 
boundary problem thanks to the kinetic correction term is really obtained. 
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3.1 The simplified model and assumptions 


In order to present the main ideas to obtain a second order approximation a 
simple model for solidification of a pure substance is considered, namely, the 
model in Sect. 2.4 where we set C = 1. In the definition of the free energy 
density (2.25) ® is replaced by a term h(®) with an interpolation function 
h : [=1,1] — [0,1] which is symmetric, i.e., h(-—&) = —h(G), and fulfills 
h'(+1) = 0. For s4 we simply write s, and T4 is replaced by Tm. The kinetic 
coefficient splits into a main part and a positive correction term of order €, 
i.e., @ = ag + ĉa. The correction term will later be determined and turn out 
to be crucial to get a higher order approximation of the related free boundary 
problem. The heat diffusivity K is assumed to be independent of the phase 
field variable. The governing equations then have the form 


E (ao + €01)0,6 = E AS — p'(H) — 2 (s(n —T))h'(9), (3.1) 


OT + LOh(®) = KAT. (3.2) 








To obtain a well-posed problem initial and boundary conditions have to be 
imposed. Consider a domain 2 C R? and a time interval Iy := (0,7). For 
E > 0 let (T(t, x; €), O(t, 2; €)), x € R, t € Iz, denote smooth solutions to 
(3.1)-(3.2) given the same initial and boundary conditions. We suppose that, 
for all times, there exist two phases separated by a diffuse interfacial layer 
which is bounded away from the boundary of the domain N2. Here, we do not 
carry out the asymptotic analysis for the initial and boundary conditions but 
only give some remarks. That analysis is carried out in [St05b], Sect. 3.2, and 
[GS06]. 

The following procedure of matching asymptotic expansions is outlined 
with great care in [FP95, DW05]. Here, only the main ideas for the two- 
dimensional case are sketched. The family 


r(t; £) := {xE R: Ot,2;£) =0}, €>0, te Lr, (3.3) 


is supposed to be a set of smooth curves in 2. They are demanded to be 
uniformly bounded away from 02 and to depend smoothly on (£E, t) such that, 
if € N 0, some limiting curve T(t; 0) is obtained. With !(t;€) and 2%(t;€) 
we denote the regions occupied by the liquid phase (where (t, x; £) > 0) and 
the solid phase (where &(t, x; £) < 0) respectively. 

Let y(t, s;0) be a parameterization of I'(t;0) by arc-length s for every 
t € Ir. The vector v(t, s;0) denotes the unit normal on I’(t;0) pointing into 
'(t;0), and T(t, s;0) := Os7(t, s;0) denotes the unit tangential vector. For £ 
small enough the curves I(t; €) can be parametrized over I'(t;0) using some 
distance function d(t, s; £), 


y(t, s; E) := y(t, 830) + d(t, s; €)v(t, s; 0). 


Close to € = 0 we assume that there is the expansion d(t, s; £) = ¿tdi (t, s) + 
£7 do(t, s)+ O(E?). Also the curvature x(t, s; £) and the normal velocity v(t, s; £) 
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of T(t; £) are smooth and can be expanded in €-series (cf. the Appendix of 
[GS06]): 
r(t, 5; €) = K(t, s; 0) + €(K(t, 8:0)*di(t,s) + Ossdi(t,s)) +O), (3.4) 
u(t, 5; €) = a(t, $; €) - v(t, s; E) = v(t, 8:0) + £8°dı (t, s) + O(€*). (3.5) 
Here, 0° denotes the normal time derivative, see (A.1) for a definition. 


We suppose that in each domain Æ C R? such that Æ C 2\I(t;0) the 
solution can be expanded in a series close to € = 0 (outer expansion): 


K 
T(t, æ; £) = > C?6,(t,2) + OEH), 

ma (3.6) 
D(t, x; €) = X E prlt, x) + O(I). 

k=0 


Let z be the z-scaled signed distance of x from I(t; 0). Hence, in a neigh- 


borhood of I'(t;0) we can write for z 4 0 
T(t, 8,236) =T,a(t,s,2);6), F(t, 8,2; €) = S(t, a(t, s, z); £). 


An essential assumption is now that T and Ê can be expanded in these 
new variables (inner expansion), 


T(t, s,z;€) = 2 Ty(t, s, z) +O(E**"), (3.7) 








H(t, (t, 8, 2;€) = 2 Blt, s, 2) OEE +H), (3.8) 


and that these expansions are valid for z € R. The notion is that, since the 
interfacial thickness scales with €, one can expect a meaningful convergence 
behavior when rescaling the space with 1/€ in the normal direction. 

Given x ¢ I(t;0) clearly z(t,x) = dist(a, r (t;0))/E& + +o as EN 0. 
On the other hand, in that limit x is located in one of the two phases, and 
the closer it lies to the interface ’(t;0) the better the series of the functions 
6,(t,2) approximates the value of the temperature on the interface. These 
facts are reflected by the following matching conditions relating the outer and 
inner expansions (see [St05b], Sect. 3.1, and [GS06] for the derivation): As 


Zz — TOO 




















To(z) ~ 40(0™), (3.9) 
Ti(z) ~ 01(0*) + (V09(0*) - v)z, (3.10) 
-Tı (z) ~ V40(0~) “yy (3.11) 
0.T2(z) ~ VO1(0") -v + ((v: V)(v + V)00(0*)) z (3.12) 


Multiscale Problems in Solidification Processes 43 


and analogously for ®. Here, for a function g(t, x) = g(t, s,r) with the signed 
distance r = dist (a, T(t; 0)) 


t) := lim ĝ := lim g(t : 
g(0") lim g(t, s.r), g(07) lim ĝ(t, s.r) 


3.2 Outer solutions 


Away from T(t; 0), i.e., in domains E C R? with E C Q\I(t;0), the expan- 
sions (3.6) are plugged into the differential equations. All terms are expanded 
in €-series. 

To leading order €° equation (3.1) yields the identity 0 = —p' (po). The only 
stable solutions are the minima of p, hence yp = +1. These values distinguish 
the two phases because, since the result is independent of €, necessarily yp = 1 
in Q! and yo = —1 in 2°. 

To the next order £! the identity 





2 
0 = —p" (vo) y1 — zgr — 0o)h (po) 








follows. By h'(+1) = 0 and p” (+1) = 4 we obtain yı = 0 as the only solution. 
The energy balance equation (3.2) yields the heat equation, to leading 
order for #9 and to the next order for 01: 


0, = KAO, k=0,1. 


Observe that it is possible to replace ĝo by the internal energies e (0o) = 
o + L of the liquid phase or e*) = bo — L of the solid phase. 

The initial conditions and boundary conditions on O02 are independent of 
€ and, hence, only enter 69 and yo respectively. The higher order corrections 
fulfill homogeneous initial and boundary conditions. Boundary conditions on 
I'(t;0) will be obtained by matching the expansions with the expansions in 
the interfacial region. 


3.3 Inner solutions 


Derivatives with respect to (t,x) transform into derivatives with respect to 
(t, s, z) as follows: 


— = — 200, + 8? — (8° dı)ð; + O(8), 


£ 
T (O,d1)°0-. = 2051052 = (K?(z + dı) ig Ossd1)Oz oe Oss F O(E). 
The phase field equation first yields 


0 = z-o — p'(Go). (3.13) 
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By (3.3) and the assumption that (3.8) holds true for € = 0 we have @o(z = 
0) = 0. The matching conditions (3.9) imply 














olt, 8, z) > polt, s;0~) = +1 as z > too. 


Hence, the solution to (3.13) is ®o(t, s, z) = tanh(z) and only depends on z. 
For the conserved variable we get 0 = K0zzTp to order €~?. By the match- 
ing conditions (3.9) To has to be bounded as z — -too, hence we see that To 
must be constant with respect to z which means To = To(t, s). The matching 
condition (3.9) furthermore implies that To(t,s) is exactly the value of @ in 
the point y(t, s;0) € T(t; 0) from both sides of the interface. In particular, 





o is continuous across the interface T(t; 0). 


To order £! equation (3.1) yields 


2 
—agv0,Po = Oz,P1 = kô- Bo — p” (Bo) Py = 5g sm — To)h' (80). (3.14) 
oO 








From the outer solutions we have y(t, s,0=) = 0 and Vyo(t, s,0*)-v = 0. 
Due to the matching condition (3.10) we conclude ® (t, s, z) — 0 as z — boo. 
The operator £(@9) = zz — w” (Bo) is self-adjoint. Differentiating (3.13) with 
respect to z we obtain that 0,®p lies in the kernel of L(®o). Since y(—z) = 
—®o(z), -Bo and h’(@o) are even, (3.14) allows for an even solution. In the 
following we will assume that #1 indeed is even. 

A solvability condition can be deduced by multiplying the equation with 
-Po and integrating over R with respect to z: 





2 
0 = I (( — agv)(O,Bp(z))? + 3g Tr — Ta)W(Bo(2))0-0(2)) dz 
R oO 
4 
= (rs —aov) + zgr — 60) (3.15) 
where we used that f,(0,®o)?dz = $. Up to the factor $ this is the Gibbs- 
Thomson condition (2.6). 
The system (3.2) becomes to the order €71 
—v0;(To + Lh(@o)) = K0,.T}\. 


Integrating two times with respect to z furnishes 





f= -ef h(@o)d2! + (vTo — A)2) +7 (3.16) 


~ -L (0+1) - A)z- vLH) +7 as z > œ 





ve 


a (om —L) - A)z - vLH) +7 as z => —00 
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where A and 7 are integration constants and 


co 0 
oe J (1 — EE (1 + h(Go(2')))ae’. 
0 —co 
Here, we used the fact that Po converges to constants exponentially fast, so 
that the integral ai has been replaced by i while the linear terms remain. 
By (3.10) 





6,(0+) =7+ —LH (3.17) 
K 
which means, in particular, that 


0ı is continuous across I(t; 0). (3.18) 





With (3.11) and To = 69(0*) the following jump condition is obtained on 
I(t; 0): 





[-K Vo], -v = (v(To + L) — A) — (v(To — L) — A) = vle(8o(0))]s. (3-19) 
Since So only depends on z the phase field equation to order £? gives 
a Qov; ®ı i a1v0,Po = ao(ð°d1)ðz Bo 
a OzzP2 = p' (Bo) Po T (O,d1)0z2.Bo _ (K?(z T dı) + Ossd1)0,Po 
1 2 2 
— 60,8, — =p” (Bp) (1)? + —s(Tm — To)h” (Go) ®, + —sTyh' (Go). 
2 30 30 
To guarantee that @2 exists there is again a solvability condition which is 
obtained by multiplying with 0,4 9 and integrating over R with respect to 


z. The terms involving ®; vanish. For this purpose, equation (3.14) and the 
assumption that ®ı is even is used. Let 





Js= 4 0.(h o So) (z) fa — (ho ®)(z2'))dz'dz 
0 0 
=f a-(hoGo)(2) f (1 + (ho Bo)(z'))dz'dz. 


Using (3.16) to replace Tı and, after, (3.17) to replace 7 a short calculation 
shows that the solvability condition becomes (remember that 2L = sTm) 


0= a(—apd° + Os5 + &7)dy — s01 
1 
+o(- oa, + (H + J) Ts”). (3.20) 
We remark that 0°d; and (ss +7)d; are the first order corrections of the nor- 
mal velocity and the curvature of I(t, s;€) (see (3.5) and (3.4) respectively). 


Indeed, when inserting the expansions for T = 69 + £0, +... and the interface 
distance d = €d,+... into the Gibbs-Thomson condition cav = ok+8(Tm—T) 
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then, to leading order, we get (3.15), and the first line of (3.20) is the equation 
to first order in €. 

The goal is to obtain that 6; = 0 and dı = 0 are solutions to the equations 
they have to fulfill. For this purpose, the second line of (3.20) must vanish. 
But by suitable choice of the additional correction term a , in the kinetic 
coefficient, namely 


are (H+ J) ZT ns”, (3.21) 


this in indeed ensured. 

Analogously to the above correction to the Gibbs-Thomson condition we 
are interested in deriving a first order correction to the jump condition (3.19). 
The equation (3.2) yields to order €° 


= vð: (Tı + Lh'(@o)@1) + (0° _ (0°d1)0-)(To + Lh(@o)) 
=K (022T> = K0,T) + OssTo) . 


Integrating once with respect to z leads to 


— K0.zT2 = v(Tı + Lh'(@o)@1) —B 
a” 
(4) 
+ (—0° + (0°d1)0.)(To + Lh(®o))dz' = KKT +K0e5Toz 
0 =“ 
(ii) 


where B is an integration constant. We need to collect the terms contributing 
to V6, - v. In view of (3.12) this means that the terms linear in z are not of 
interest. Applying (3.10) to 1, Tı and by the assumption h’(0) = h’(1) = 0 
it holds that 





(i) ~ v6, — B+(...)z as z > too. 
Furthermore, since 6° ®o = 0, 


(ii) = —8° (To  L)z + (0° dy) L(h(Go))|5 
~oe Oa + (8° dı)L as Zz > ©, 


~ 8e — (8°di)L as z => —oo. 








By (3.10) and (3.18) we get (iii) = kK01 + (...)z as z — too. Finally, the 
first order correction of the jump condition (3.19) at the interface is 


[—KV6,]) -v = v0, + 2L(8°d,). 
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3.4 Summary of assumptions and stated problems 


Let us now collect the equations. First, the problem to leading order is stated: 


(LOP) Find a function 09 : Ir x R — R and a family of curves 
{T'(t;0) hrer, separating Q into two domains 2 (t; 0) and 2°(t;0) such 
that 

Ae) (bo) = KAO, in 2°(t;0),t € Ir, p= 8,1, 


and such that on I’(t;0) there holds for all t € Iy: 
Oo is continuous, 
[-K Vo], < v = vle(9o)]5, 
oaov = 0k + s(Tm — 00). 
If we define a, as in (3.21) then the correction problem reads as follows: 


(CP) Let (00, {T (t; 0)}+) be a solution to (LOP). Let I(t) be the length 
of I'(t;0) and set Sr, := {(t,s) : t € I7,s E [0,1(¢))}. Then find 
functions 0; : Iy x R — R and dı : Sr, —> R such that 


0,6, = KAO, in 2°(t;0),t Elz, p=, 1, 
and such that on I'(t;0) there holds for all t € Iy: 


0, is continuous, 
[-—KV0,]) "V vO, + (0° dı) [e(Ao)]é 
aay O°dy = o(Os5 + K”)dy — 801. 


Obviously, (61, d1) = 0 is a solution to the correction problem (as previously 
remarked, the boundary conditions on 02 are homogeneous). If this solution 
is unique then the leading order problem is approximated to second order 
in € by the phase field model. Problem (CP) is in fact the linearization of 
(LOP), i.e., the problem resulting from (LOP) when inserting the expansions 
T = 09+ 60,+... and d = €d,+.... We point out again that the choice (3.21) 
is crucial in order to guarantee that the undesired terms in (3.20) vanish. 


3.5 Numerical example 


In [GS06] several numerical tests have been performed revealing that the 
free boundary problem can indeed be better approximated by the phase field 
model with the correction term. Fig. 3.1 shows the results for an undercooled 
binary alloy (the potentials, physical parameters, and initial values are pre- 
cisely stated in [GS06], Sect. 4.3). A planar solid-liquid front moves into the 
liquid phase. On the right the figure shows the profiles of the concentration 
of one component during the solidification. The position of the interface, i.e, 
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Fig. 3.1. Numerical test of the correction term. On the left: the position of the 

interface depicted over €. On the right: profiles of the concentration c during evolu- 

tion. 


the point where (t, x; £) = 0, is depicted on the left for several values of £, 
the other parameters being fixed. 

Simulating with the correction term (3.21) in the phase field equation and 
varying € the changes in the interface position turned out to be of about 1073 
which is smaller than the grid spacing Ax = 0.02. In contrast, if the correction 
term was not taken into account changes of several grid points were observed. 
This behavior in € indicates that the approximation of the sharp interface 
solution is improved thanks to the correction term. 


3.6 Remarks on the multi-phase case 


When multiple phases are present the asymptotic analysis leads to a leading 
order problem consisting of the equations (2.3)—(2.6), (2.9), and (2.10) (cf. 
[GNS98, GNS04]). Indeed, the procedure presented in the previous subsec- 
tions yields the equations (2.3)—(2.6). To obtain the force balance (2.9) (and, 
analogously, (2.10)) it is assumed that, away from the triple junction on a 
diffuse phase boundary, the situation is just as in the case of two phases. 

Aiming for a second order approximation of the force balance we observed 
that, in general, in the interfacial regions not only the phase field variables of 
the adjacent phases are present but also phase fields corresponding to other 
phases appear. It turned out that these artificial third phase contributions do 
not trouble the first order asymptotic analysis but a second order analysis. 
As a first step we therefore developed and analyzed suitable multi-well poten- 
tials w that avoid the third phase contributions (cf. [St05a, GHS06]), smooth 
potentials as well as potentials of obstacle type. 

As an additional feature, the calibration of the phase field model with 
respect to given surface energies cag(v) and mobility coefficients mag(v) be- 
comes much simpler. It is shown in [BBR05] that the T-limit of (2.19) as ô — 0 
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has the form of the surface contribution in (2.12), and a relation between the 
Cap and the functions a and w is derived. Using matched asymptotic expan- 
sions, [St91] for the isotropic case and [GNS98] for the general case proposed 
the simpler relation 


sast) =int { | Vopr Evd, 
p € C®™1{([-1, 1]; 2"), p(-1) = ea, p(1) = ea}. (3.22) 


Using numerical simulations they got evidence that this formula seems to hold 
true for a large class of anisotropies. 

The new potentials w are such that solutions to (3.22) exist with p; 4 0 
only if i = a, B. Moreover, it is possible to adapt coefficients in w and cali- 
bration functions in a such that the integral in (3.22) becomes a given surface 
energy. Similarly, the relation between the mag(v) and w(¢,V¢) becomes 
much simpler thanks to the new potentials. 


4 A homogenized two-scale model for a binary mixture 


In this section we apply the theory of homogenization to a simplified physical 
situation with periodic equiaxed dendritic microstructure which is described 
by a phase field model for a binary alloy. The resulting model will be a two- 
scale model that consists of a macroscopic heat equation and of microscopic 
cell problems that describe the evolution of the phases and the microscopic 
solute transport at each point of the macroscopic domain. In order to justify 
the formal asymptotic expansion, an estimate is established that compares 
the solution of the two-scale model to that of the original model. 

The phase transition problem to be considered is given by equations (2.26)— 
(2.28), i.e., 


AT + LO — V - (K()VT) = 0, 
aC — V - (D1(®)VC) — V - (D2(C, 6) VS) = 0, 
= 0, 


a£?’ — E AD + p' (D) + q(T, C, D) (4.3) 
to be solved in the time-space cylinder Qro := Ir x Q with time interval 
Ir := [0,7] and domain 2 C R2. The diffusion tensors are assumed to be 


Lipschitz-functions of the phase field ®, they shall be symmetric, Kij = Kji, 
Dy ij = D154 for i, j = 1,...,d, as well as elliptic and bounded, 


ko|z|? Ss Kj %25 < ky |z|*, do|z|? < Dy 5 2412; < d,|z|? (4.4) 


for all z € R? with positive constants ko < kı and dọ < dı independent of 
®. Here and in the sequel, the sum convention is used. The function Də : 
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Fig. 4.1. Periodic microstructure 


R? — R¢4 is Lipschitz and bounded. The function p represents the double- 
well potential p(®) = 4(8? — pe and q : R3 — R is a Lipschitz function. 
The differential equations are supplemented by Dirichlet conditions for the 
temperature and homogeneous Neumann conditions for concentration and 
phase field, 


T = Tive, (Di(®)VC + Do(C, ®)V®) -Vest =0, VB- Vest =0 (4.5) 
on Sro := Iy x OL, and by initial conditions 
T (0, -) = Tibes C (0, -) = Cic and (0, -) = Dic (4.6) 


on 2. For simplicity of the notation the Dirichlet condition and the initial 
condition of the temperature are given by the same function Tip, that is defined 
on Qro. 

Let us introduce some notation for function spaces. Spaces of functions 
with continuous derivatives of order 8 are denoted by C%(Q), L,(Q) is the 
Lebesgue space of functions whose r-th power has an integral, W} (Q) is the 
Sobolev space of functions with derivatives of order k whose r-th power is 
integrable, and H? (Q) = we (Q). In anisotropic spaces of the type C*“(Ix Q) 
or W*“(I x Q) with time interval J, the index k refers to the time variable 
and £ to the space variables. 


4.1 Asymptotic expansion and the two-scale model 


To construct a model that is suitable for a very small scale of the evolving 
dendritic structures, we consider a sequence of problems of varying scale £ > 0, 
study the limit € — 0 of their solutions, and construct a limit problem that 
is valid for the limit of these solutions. This limit problem may be used as an 
approximation for situations with small but non-vanishing scale €. 

This procedure is done for an idealized equiaxed dendritic microstructure 
that consists of equiaxed crystals growing at the nodes of a uniform grid with 
edge length given by the scale parameter £, see Fig. 4.1. This situation is 
generated by the initial data 

Tipe (e) = Tipe (e), CR (1) = Cf.) (#, 2) and SE (a) = 8,2) (47) 


ibc ibc ie 
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with functions T € Lo(2), C®,8® € Lo(;Cy(Y)). The domain Y is a 
unit cell, by definition this is a bounded, simply connected Lipschitz domain 
with the property that R? can be represented as union of shifted copies of Y 
with no intersection of their interiors. For simplicity of the presentation, the 
volume of the unit cell is scaled to one. The standard example for Y is the unit 
cube Y = [0,1]¢. The set C(Y) contains all periodic continuous functions 
in R? with periodicity cell Y, the subscript # indicates periodic boundary 
conditions with respect to y € Y. Condition (4.7) describes instantaneous 
nucleation at time t = 0 of a periodic distribution of nuclei. In order to obtain 
a well-defined asymptotic limit for € — 0, it is necessary to scale some given 
data in dependence of ¢. Here it is assumed 


E=e€&), a=e ag, and De = en, C= 139) (4.8) 


The scaling of € is obvious: if the size of a solid crystal is proportional to 
£, and if we model this crystal by a diffuse interface model, then the width 
of the diffuse interface must be bounded by const -€ with a constant that 
is small compared to the size of the crystal. Hence éo is a small phase field 
parameter that is fixed in the asymptotic expansion. The relaxation parameter 
a is scaled such that the total relaxation factor a€? in the phase field equation 
remains constant. The scaling of the solute diffusivity is motivated by the fact 
that dendritic structures are created by a competition between a diffusional 
instability and surface energy. At least one of the diffusivities K or Di, D2 
has to be scaled in dependence of £. Since solute diffusivity is usually smaller 
than heat conductivity, it is natural to scale Dı and D2. The fact that Dz and 
€ are both scaled proportional to £? does not indicate that they are of similar 
size: in fact we expect D® to be of the size 1 and &9 to be small compared 


to D® , but the relation De/£ is kept fixed. 
In order to study the limit ¢ — 0, the existence of an asymptotic expansion 


ue(t, £) = uo (t,x, £)+eus(t, T, 2) +e°ua(t, x, £)+. -- forw=T,C,® (4.9) 


is assumed. The existence of such an asymptotic expansion is not guaranteed. 
The result of the calculation will be justified in the next section. The gra- 
dient of a function x +> u(x, #) is given by Vu = Vou + +4Vyu, where Vz 
and V, denote the gradients with respect to the first and second variables 
of u, respectively. The asymptotic expansions (4.9) and the formal relation 
V = Vz + +Vy are used in the differential equations (4.1)-(4.3). Then the 
coefficients of different powers of € are compared, starting from the lowest 
order. For the -dependent conductivities we use a Taylor expansion that is 
abbreviated by Ke = Ko +¢K1+e7Ko+-:+ with Ko = K (o) and analogous 
expansions for D® (8), DP (C, 8). The validity of these expansions with a re- 
mainder of order £f requires K, p® eof (R; R42) and p® e€ ce (R?; R24). 

The problem of 1st order consists of the terms of order ¢~? in the heat 
equation (4.1); these are 
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—Vy-(KoVyTo) = 0 in Qroy := Ir x Q xY, 
To is Y-periodic with respect to y. 


The solutions of this problem are constant with respect to y, hence To(t, x, y) = 
To(t, x) is independent of y. 

The problem of 2nd order is given by the terms of order e7! in the 
heat equation, 


—Vy-(Ko(VyTi + VeTo)) = 0 in Qroy, 
Tı is Y-periodic with respect to y. 


This is a linear elliptic equation for Tı with right hand side defined in terms 
of To. Its solution can be represented by 


TG, ay) = 


Me 


H;(t, £, y) z; To(t, x) 


j=l 


with the solutions H; of the local cell problem 
—Vy,- (KoV,H;) = Vy-(Koe;), Hj is Y-periodic, 


where e; is the j-th unit vector of Rt. Both Ko and H; depend on ®o. 
The problem of 3rd order consists of the terms of order e° in the heat 
equation, the diffusion equation and the phase field equation, 





OT + LOD — Vy: (Ko(VyT2 + VTi) + Ki(VyTi + V2To)) a 
-Vz + (Ko(VyTi + VeTo)) = 0, 

3Co — Vy - (DO(Go)VyCo) — Vy - (DOXCo,Fo)Vyho) = 0, (4.11) 

aota At Bo — E Ay Lo ne (Bo) + a(To,Co, Po) = 0 (4.12) 


on Qrey, supplemented by periodic boundary conditions on OY for To, Co 
and o. Equations (4.11) and (4.12) do not contain any derivatives with re- 
spect to x. Hence they can be interpreted as a set of differential equations 
defined on Qry := Ir x Y for every parameter x € N. Equation (4.10) is 
transformed into a macroscopic equation for To = To(t, x) by integration with 
respect to y € Y. Due to the periodic boundary conditions the V,--term 
disappears and the homogenized heat equation is obtained, 


To + LAP — V - (K*(80)VTo) = 0 


with solid volume fraction So(t, £) := fy ®o(t, x,y) dy and the effective heat 
conductivity 


d 
Ký (®0) =f CAE dy. 


k=1 
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The effective heat conductivity Aj; is symmetric, elliptic and bounded with 
the same constants ko and kı as the original matrix K, see e.g. [JKO94] or 
[H097]. 

Let us sum up the obtained two-scale model. It consists of 


e The macroscopic heat equation 
ô: (To + LB) — V - (K*(o)VT) =0 in Qro =I7r x R (4.13) 
with boundary conditions and initial conditions 


To=T on Sro = Ir x AM and TH(0,)=TO in R. 


ibc ibc 


e The definition of the averaged phase field o(t,x) = fy Po(t, £, y) dy and 
the effective heat conductivity 


K} (80) = | Kx(®0)(5jn-+ du Hlo) dy (4.14) 


with the Kronecker symbol 6;;, via the solutions H; = H;(®o) of the local 
cell problems 
-Vy + (K(Go)(VyHj +e;)) =0 nY (4.15) 
with periodic boundary conditions. 
e The microscopic problems 


8,Cy — Vy - (DO(Go)VyCo) — Vy - (DEXCo, Go)VySo) =0, (4.16) 
aoth Po — Eb AyPo + p' (Go) + q(To, Co, o) =0 = (4.17) 
in Qty = Iy x Y with periodic boundary conditions and initial data 
Co(0,2,y) =CO (x,y), Fo(0,2,y) = OD (a, y) for y € Y. 


These equations must be solved for every point x € N2 of the macroscopic 
domain. 


4.2 Analysis of the two-scale model 


The existence of weak solutions to the two-scale model is proved in [Ec04c], 
Theorem 3.3, by a fixed point approach. Uniqueness of the solution is also 
proved in [Ec04c], Theorems 3.4 and 3.5. The results can be summed up as: 


Theorem 4.1. Let Q C RÊ be a C?-smooth domain of dimension d = 2 or 
d= 3, Y C R? be a unit cell, let K, Dı : R > R4 be Lipschitz, symmetric 
and satisfy the condition (4.4), let Tine € W}? (Qro) NH (Ir; W} (9)) with 
r>d,s>1ford=2 ands > 6/5 for d = 3, Cic, Pic € Loo (N; Weg! Y) N 
Wi (9; L2(Y)) with £ > 1 + d/2, 0 < Cic < 1, suppose D2 € Col (R2; R24) 
with D2(C,®) = 0 for C ¢ [0,1], p(®) = 4 (8? — 1)”, q: R = R is Lipschitz 
and satisfies the growth condition |q(T,C,®)| < const(1 + |T| + |C| + |8|), 
and let L, €, a be positive constants. Then there exists a unique weak solution 
(T,C,®) of the two-scale model (4.13)-(4.17). 
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An estimate for the model error is derived in [Ec04b] under appropriate 
assumptions concerning the regularity of the solutions for both the original 
model and the two-scale model. Let (T+, C, P+) denote the solutions of the 
original model (4.1)—(4.3), (4.5), (4.7) with the scaling (4.8) of the parameters 
and (To, Co, o) be the solutions of the two-scale model (4.13)—(4.17) with 
initial data T®, ce ), BO), The error estimate is done in terms of macroscopic 
reconstructions of scale £ for the solutions of the two-scale model: 


ug(t, x) := uo(t, x, x/e) for u € {T,C, 8}. 
The required regularity for the solutions of the original model is 


Tell a1/21(Q70) + [Tell or sL2(2)) + €||Cel| 23/21 (Qr2) (4.18) 


+ ellBell21/22(Q70) + ICelltea(Qra) + llPellna(qra) S const 
with a constant const; independent of £. The solution of the two-scale model 
is supposed to satisfy 

To € WH? (Qro) N HV?+9 (Iz; H1(Q)), 


4.19 
Co, Bo E Loo (2;C*?(Qry)), Valo, Vebo E Loo (2; W3? (Qry)) P 


with parameters r > d+ 2, s > d and ĝ >Q. 





Theorem 4.2. Let 2 C Rİ be a bounded Lipschitz domain and Y be a unit 
cell, let K, D® eC (R; RO) be bounded and elliptic as described in (4.4), 
po E€ Co (R?; R47) be bounded, q : R? — R be globally Lipschitz and p(®) = 


1(g2_1)”, The solutions of the original model and the two-scale model satisfy 
the regularity properties (4.18) and (4.19). Let 











Tf (t,£) := To(t, x) + eH; (t,x, €) 3z, Tolt, x) 
be the first order term in the asymptotic expansion for the temperature. Then 


Zz — Tolle. azra + WCe — Colle oarr + lle- ollie. ar L2) 
+ ||Te — Ti llesc7;42(9)) £ const en 
with const independent of e. 


This theorem guarantees the order of approximation e!/? for the two-scale 
model. The exponent of e is limited to 1/2, because the two-scale model 
does not approximate the original model of scale ¢ close to the boundary of 
the domain. It must be expected that the domain of an equiaxed dendritic 
crystal growing close to the boundary is not a full shifted copy of eY, but 
a subdomain obtained by intersecting with 2. This generates an additional 
error of order e!/?. 
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Fig. 4.2. Macroscopic domain of the numerical example 


4.3 Numerical example 


In order to illustrate the two-scale model we present the results of numeri- 
cal computations for two space dimensions. The computations are done with 
constant heat conductivity K = 1 — hence no elliptic cell problem must be 
solved, — constant solute diffusivity Dı (8) = 1, D2(C,®) = —0.05 and latent 
heat 2L = 1. The function q in the phase field model is given by 
q(Vy®,T,c,®) = (1 — $?) -1.2- arctan (tzste;a (T +10- C- 48 — 2)). 
The quantity o here is correlated with the surface tension for the sharp in- 
terface limit € — 0. Its dependence on V,@ is introduced in order to describe 
the dependence of the surface tension on the orientation of the surface. The 
problem can be reformulated in terms of the function u = 10. C — i8 =2 
that plays the role of a chemical potential; the diffusion equation then takes 
the form 
(ut 48) — Ayu = 0, 


and the constitutive function q in the phase field equation is 


q(Vy®,T, u, 8) = (1 — P?) -1.2- arctan (p35 (H + T)). 


The precise form of o(V®) is 
a(V®) = 00(1 — (m? — 1)01 cos(m(O(V®) — Oo))), 


where go describes the average value, cı is the strength of the anisotropy, m 
describes the symmetry pattern of the dendrites, O(V®) is the angle between 
V@ and the x1-axis and Oy is an offset angle. The special choice of q involving 
the arctan function is chosen in order to ensure that the minima of the po- 
tential for fixed T, u, Vy® are kept at ® = +1, even for large deviations from 
the equilibrium melting point; following the proposition of Kobayashi [K093]. 

Both the global heat equation and the microscopic problems are discretized 
by bilinear finite elements on uniform rectangular grids. The equations are 
decoupled by the time discretization in the following way: first a partially 
linearized version of the phase field equation is solved with temperature, con- 
centration and V,@ taken from the previous time step, then the diffusion 
equation is solved. This is done for every grid point of the macroscopic grid, 
then the global heat equation is solved. The decoupled linear equations are 
discretized with respect to time by the Crank-Nicolson scheme. This gives 
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Fig. 4.3. Evolution of left and right crystal for Oo = 0 
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Fig. 4.4. Evolution of specific data for Oo = 0 


a semi-implicit time-discretization of the two-scale model, with an implicit 
discretization of the main parts of the differential operators. 

The examples to be presented are computed for oo = 0.0002, m = 4, 
€ = 0.005, a = 5 and cı = 0.05. The initial conditions are T = —0.1 and 
u(T,c,&) = —0.1, this adds up to a total initial undercooling of —0.2. The 
unit cell for the microscopic problem is Y = [0, 1]?, the initial solid nucleus is a 
sphere of radius r = 0.05 located at the midpoint (0.5, 0.5) of Y. The boundary 
conditions are periodic boundary conditions for the microscopic problems and 
given heat fluxes for the macroscopic equations. The macroscopic domain is 
Q = [0,2] x [0,0.5]; we prescribe homogeneous heat fluxes VT - vex: = 0 on 
[0,2] x {0}, {2} x [0, 0.5] and [0, 2] x {0.5}, on the remaining part {0} x [0, 0.5] 
of the boundary we prescribe the heat flux VT - vex, = —1. The macroscopic 
equation is discretized by a uniform rectangular grid with 19 x 1 elements; it is 
essentially one-dimensional, the crystals evolving at the same 21-position are 
equal. The microscopic problems are solved with uniform rectangular grids 
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Fig. 4.5. Evolution of left and right crystal for Oo = 0.4 
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Fig. 4.6. Evolution of specific data for Oo = 0.4 


of 300 x 300 elements. The time step is At = 2-10~°, the final time of all 
computations is t = 0.2. 

The figures show the results for three different orientations Oo = 0, Oo = 
0.4 and Oo = 7/4 for the anisotropy of the surface tension. Figures 4.3, 4.5 and 
4.7 show the evolution of the left and right crystals from the initial time t = 0 
to t = 0.2 in twenty steps. The left crystal is that growing at xı = 0, the right 
that at xı = 2. Due to the boundary cooling at xı = 0 the left crystal grows 
quickly, whereas the right one evolves rather slowly; its driving force is limited 
to the initial undercooling. For the left crystal, the offset angle Og = 0 leads 
to shorter dendrites than the other angles, here the interaction of neighboring 
crystals happens earlier than in the cases Oo = 0.4 and Oo = 7/4. This effect 
is not visible for the right crystals which are in an early stage of their evolution. 
In Fig. 4.4, 4.6 and 4.8 the evolution of the specific data (specific volume and 
specific surface) is depicted for selected crystals in a row in «1-direction, the 
number corresponds to the position of the crystal, starting with position 1 at 
zı = 0. Further examples are presented in [Ec04a] and [Ec04b]. 
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Fig. 4.7. Evolution of left and right crystal for Oo = 7/4 
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Fig. 4.8. Evolution of specific data for Oo = 1/4 


4.4 Some remarks on the numerical analysis 


Error estimates for simple finite element discretizations of both the origi- 
nal model for scale € and the two-scale model are derived and compared in 
[Ec02]. For linear or bilinear finite elements on a grid with mesh size h and 
a discretization with respect to time by an implicit Euler scheme with time 
step At, the error for the original model of scale € is 


2 
const ((2) + ar) 


This estimate reveals the typical convergence properties of the chosen dis- 
cretization for parabolic equations: convergence of second order with respect 
to the space variables and of first order with respect to time. The factor 1/e 
of the mesh size h accounts for the obvious fact that the microstructure starts 
to be properly resolved for h < e€ only. The discretization of the two-scale 
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model uses a global grid for the macroscopic heat equation defined on 2, at 
each node of this global grid the local cell problem defined on Y is solved with 
a local grid for Y. The mesh size of both grids is related to ho, the time step 
is again At. Then the error estimate for the two-scale model is 


const (hg + At) . 


Obviously no dependence on € is present here. In order to have comparable 
computational complexity, the mesh sizes ho for the two-scale model and h 
for the original model should scale according to ho ~ vh. Respecting also 
the model error of order ¢!/? for the two-scale model we conclude that the 
two-scale model is superior, if the mesh size used for the original model is 
larger than the threshold he = const3e°/* with a suitable const3. 


4.5 Conclusion 


The presented two-scale model is an approximate model for a problem with 
scale ¢ of the microstructure, with increasing accuracy for decreasing £. Nu- 
merical computations with this model are valid for a whole range of microscale 
parameters £ € (0,€o0] with the appropriate diffusivities. The model is suit- 
able for material with fast heat diffusion and slow solute diffusion, where the 
temperature is assumed to be essentially constant on the microscopic scale, 
while the solute transport is neglected on the macroscopic scale. 

Extensions of the presented two-scale model may be possible for more com- 
plex physical phenomena, for example phase transitions with convection, and 
physically more realistic situations, in particular for non-periodic microstruc- 
tures. The extension to models with convection is probably possible by the 
application of techniques similar to those presented here to available phase 
field models that include convection, see e.g. [AMW00], [BD*99], [NW*00]. 
For phase transitions with density differences between solid and liquid — where 
convection cannot be avoided — it may be necessary to use a unit cell that 
is fixed in Lagrangian coordinates but moves and deforms with the flow in 
an Eulerian description. Non-periodic microstructures can be described by a 
probabilistic description of the initial conditions, then it is possible to ap- 
ply techniques of random homogenization of the type described in [JKO94]. 
A corresponding stochastic version of the two-scale model can be found in 
[Ec04a]. 


A Facts on evolving surfaces and transport identities 


Let Ir = (0,7) C R be a time interval and let m, d € N with m < d. 
Definition A.1. (X;)er, is an evolving m-dimensional surface in R? if 


1. for each t € Iz, the surface X, can be parameterized over a fixed smooth 
orientable submanifold U C R™*?, 


60 C. Eck, H. Garcke, B. Stinner 


2. the set ©” := {x' = (t,x): t € Iz, 2 € Xi} C RxR is a smooth 
m + 1-dimensional surface, 

3. the tangent space Ta: X' is nowhere purely spatial, i.e., Ty” 4 {0} x V 
with V = R”. 


The spatial tangent space of dimension m in x € 3; is denoted by T,.2%, 
the spatial normal space of dimension d — m by N, 5; := (Tz5;)+. There 
is a unique vector field vy : ©” — R+! such that (1,vs(t,z)) € Te” 
and vy (t,x) E€ N,24; v(t, x) is the vectorial normal velocity of the evolving 
surface. It can be verified that 


Ty” = {(s, svz(a’)) + (0,7): s E€ R,T € Ta Xt}, 
Ny 2” = {(-vz(2")-v,v):v E Ng}. 


Let y be a smooth scalar field on X’. The derivative 
O(a’) := 3a ve(2’))P(2’) in a2’ = (t,£) € X’, (A.1) 


is the normal time derivative of y in 2’ and describes the variation of » 
when following the curve 6 +> c(d) € 445 defined by c(0) = x and Osc(d) = 
vz(t+ 6,c(d)), 6 € (t — ĝo, t + ĝo) with some small ôo > 0. 

Let (Tx(t,x))%_, be an orthonormal basis of Ta X+. By ôn, p(x) the dif- 
ferential of y into direction (0, T) € Tz” is denoted. The surface gradient 
of ọ in 2’ is defined by Vuy(2") := Ty, Onv(a’)te E€ Tr Xt. Let p be a 
smooth vector field on X’. The surface divergence of ~ in x’ is defined by 
Vx pla’) = | Op, ola!) T- 

If m = d—1 the normal space NX, has dimension one, and X” is ori- 
entable. Then there is a smooth vector field vs of unit normals, vy(a’) € 
Nz, |vo(a’)|2 = 1. The (scalar) curvature and the curvature vector then 
are defined by 

Kg := -V5 “VS, Ky := KEV. (A.2) 


Moreover, the (scalar) normal velocity then is defined by 
vs = V5: V5, (A.3) 
and the following relation, derived in [Gu00], Chapter 15b, holds: 
vs = -V svs. (A.4) 
Definition A.2. I” := (Ti): is an evolving m-dimensional subsurface of 2” 


if 


1. the set T, is a relatively open connected subset of X, for each t € I7, 

2. the boundary OI’ := (ƏT); consists of a finite number of evolving m — 1- 
dimensional surfaces such that, locally for each t € Ir, OI; is the graph 
of a Lipschitz continuous map. 
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A vectorial normal velocity vər can be assigned to the pieces of ôI” while I” 
obviously has the same vectorial normal velocity as ©”, namely vy. 

In some point x € OF; the tangent cone on I; is denoted by T, J}. If x is 
in the interior of one of the pieces the cone is a half-space of T,;. Besides 
then the boundary of T,/; in T,X coincides with the tangent space of the 
boundary OJ;, i.e., OT,/; = T,OL;. In such points æ there is a unique unit 
vector Tr E€ TX O Nôl; with tp - 7 <0 for all 7 € TI. This vector Tr is 
said to be the external unit normal of I; with respect to Xr. 

Let m = d—1andd < 3. First, a divergence theorem is stated for a smooth 
surface with piecewise smooth Lipschitz boundary like I; as in Definition A.2: 


Theorem A.3. ((Be86], Corollary 4 ) In the above situation there is the 
following identity: 


| (Ve: pts: ) ana) = f yp: trdH™ |. 
T; ory 


If ~ is a tangent vector field then Ks - yp = 0 so that one gets the usual 
divergence theorem on surfaces. It should be remarked that the proof in [Be86] 
is performed for smooth OI but there is a brief note on the above case of 
a piecewise smooth boundary at the end of Sect. II(2). Finally, a transport 
identity is stated: 


Theorem A.4. ([Be86], Theorem 1) In the above situation it holds for every 
te Ir that 


d a 
a (fem ) 


Remark A.5. If vs = 0 and ky = 0 then J; is flat, 0° reduces to ô; and var 
is tangential. Altogether, the Reynold’s transport theorem is obtained. 





=| (Op — pys: Ky) am f (vər tr) dH™?. 
t Ti ol: 
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Summary. Imposed by the crystal lattice, at the surface of a crystal, there exist 
atomic steps, which separate exposed lattice planes that differ in height by a single 
lattice spacing. These steps are long-living lattice defects, which make them suitable 
as a basis for the description of surface morphology on a mesoscopic length scale and 
thus are an ideal approach to overcome the different length scales, which range from 
several atoms in lateral direction to micrometers in horizontal direction. This paper 
summerizes an approach how the thermodynamics and kinetics of atomic steps can 
be coarse grained to continuum models for the evolving surface. We discuss phase- 
field approximations to the step dynamics model and apply them to various growth 
procedures. 


1 Introduction 


A deep understanding of the fundamental physics underlying epitaxial growth 
techniques is a prerequisite for any significant success in semiconductor nan- 
otechnology. The frontiers in developing novel electronic devices lie in the 
realm of the length scales from nanometers to microns. Controlling the surface 
morphology during epitaxial growth at such scales is a challenging task. In this 
regime kinetic effects on an atomistic scale strongly influence the morphology. 
Thus a continuum theory cannot describe the growth process appropriately. 
On the other hand the applicability of discrete models on an atomistic scale 
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is limited due to its high computational cost if time and length scales related 
to device applications are considered. 

As first pointed out by Burton, Cabrera and Frank [7] an intermediate 
regime between atomistic and continuum modeling can be used to combine the 
appropriate atomistic properties with the computational ease of continuum 
models. At crystalline surfaces the evolution of the steps, their advancement, 
nucleation and annihilation, can be used to describe the surface morphology of 
the growing film. This allows to break the model in a 2+ 1-dimensional model, 
continuous in the lateral directions but discrete in height. The evolution of 
steps is implicitly connected to atomistic motion via attachment and detach- 
ment at step edges and therefore allows to incorporate essential kinetic effects 
on an atomistic scale. Various step flow models have been developed which in- 
corporate more and more of the relevant atomistic effects. These models have 
proven to be extremely powerful in understanding surface morphologies and 
the formation of patterns at the nanometer and micrometer scales, see [25, 37] 
for recent reviews. During the last years powerful numerical techniques for the 
solution of these problems have been developed and make step dynamics an 
attractive alternative to atomistic modeling approaches in epitaxial crystal 
growth. Level-set based methods [9, 35, 39], front-tracking type approaches 
[3, 4], phase-field methods [30, 23, 36, 34, 53] and geometry-based simulations 
[29] have been developed. 

In this contribution we will mainly concentrate on phase-field approxima- 
tions for the discrete-continuum model. Thereby the discrete height is ap- 
proximated by a continuous phase-field variable. With this interpretation, the 
phase-field approximation can be viewed as a coarse grained continuous model 
for the underlying discrete-continuum step flow model. The contribution is or- 
ganized as follows: In Section 2 we introduce the Burton-Cabrera-Frank model 
and discuss some of its limitations. Sections 3 and 4 are devoted to two dif- 
ferent growth methods, Liquid Phase Epitaxy (LPE) and Molecular Beam 
Epitaxy (MBE), respectively. In the case of LPE, the film grows from a so- 
lution and transport mechanisms in the solution can influence the surface 
evolution and vice versa, and thus have to be considered in addition. In MBE 
the film grows from a vapour phase, here transport in the vapour does not 
need to be taken into account. For both situations we discuss a phase-field 
approximation and apply it to simulate epitaxial growth in a spiral growth 
model. 


2 Burton-Cabrera-Frank model 


In this section, we present a general BCF-like model for step-flow growth or 
island growth of homoepitaxial films. 
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Fig. 2.1. Microscopic processes in epitaxial growth of thin films. 


2.1 The BCF framework 


The Burton-Cabrera-Frank (BCF) equations [7] serve as a prototype of 
discrete-continuous step flow models, discrete in the height resolving the 
atomic layers in the growth direction, but coarse grained in the lateral di- 
rection. The steps, separating terraces of different height, are assumed to be 
smooth curves and serve as free boundaries for an adatom diffusion on ter- 
races. Mathematically such a model is a free boundary problem that consists 
of a diffusion equation for the adatom density on the terraces, boundary con- 
ditions at the steps and a velocity law for the motion of the steps. For a 
schematic picture of an epitaxial growing surface see Fig. 2.1. 

We denote by 2 C R? the projected domain of a film surface in a two- 
dimensional Cartesian coordinate system, and assume that N2 is independent 
of time t. In addition we denote by Q; = Q;(t) C R?, i = 0,1,..., islands or 
terraces of discrete height i. We denote further the corresponding steps (or 
terrace boundaries) by 


LA = ZONE, i=1,2,.... 


Denote by c; = ci(x,t) the adatom concentration on terrace (2;(t) at time t. 
The adatom diffusion on a terrace is described by the diffusion equation for 
the adatom density 


Oc; —V- (DriVci) = -r ici + Fr — Mr in Qilt), (2.1) 


where Dp; is the adatom diffusion coefficient on a terrace £;, TT} is the 
desorption rate, Fr is the deposition flux rate onto the terrace and Mr is the 
loss due to nucleation of adatom islands. In this paper we will not address 
the nucleation of adatom islands Mr. For a detailed description on nucleation 
and a way how to include it into the step flow model we refer to [38]. The 
fluxes of adatoms to the steps are given by 


—DriVci “Ny — ViCi = fits (2.2) 
Dry-1VeGi-1 ii + ViGi-1 = fi,-, (2.3) 
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Fig. 2.2. The Ehrlich-Schwoebel barrier. 


where fi, is the net flux from the upper terrace §2;(t) and fi,— the net flux 
from the lower terrace 92;_1(t) to the boundary I(t), n; is the unit normal of 
the step I(t) pointing from the upper to the lower terrace, and v; is the normal 
velocity of the step I(t) with the convention that v; > 0 if the movement 
of I(t) is in the direction of n;. The terms vic; describe the convection of 
adatoms due to the motion of the step and are needed for mass conservation. 
The fluxes are now related to the deviation of the adatom density at the step 
from its equilibrium value 


fiz = kyla -ci er (2.4) 
fi- = k-(G-1 — ¢§"), (2.5) 


where k4 = k4(0) and k- = k_(@) are the kinetic attachment rates from the 
upper and lower terrace to the step I(t), respectively; 0 defines the angle 
between the normal and a low index crystallographic direction. For finite ki 
the adatom density is discontinuous at the steps and for k} # k- growth 
instabilities can occur, see [25] for a recent review. Step meandering is known 
for k} > k_ (step Ehrlich-Schwoebel barrier) and step bunching occurs for 
ky < k_ (inverse step Ehrlich-Schwoebel barrier). It is observed in many 
material systems that, in order to stick to a step from an upper terrace, 
an adatom must overcome an additional energy barrier (see Fig. 2.2), the 
Ehrlich-Schwoebel (ES) barrier [17, 43, 44]. Its inverse counterpart is much 
less understood. 

In the limit k4, k- — oo the adatom density becomes continuous and the 
boundary condition reduces to ci = ci-1 = c;7, where the equilibrium adatom 
density c$? is given by 








24 
ed _ o* [1 l ki 
C; c ( + Pee), (2.6) 


with y = 4(6) the step stiffness, xk; the curvature of T;(t), c* the equilibrium 
adatom density of a straight step, kg the Boltzmann constant, T the temper- 
ature and §2, the area of an adsorption site. For the motion of the moving 
boundaries, we assume the following law for the normal velocity v; of the step 
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a = fiat fi,- + Os(VOs(QaVKi)) + Os((fi,4 + fi, -)8), (2.7) 


where v = v(@) is the mobility for diffusion along the steps, 3 = G(0) is an 
effective anisotropy to model the effect of a kink Ehrlich-Schwoebel barrier. 
The kink Ehrlich-Schwoebel barrier results from an additional energy barrier 
for the diffusion of an edge-adatom around a kink or a corner. The presence 
of such a barrier affects the nucleation of kinks at the steps and generates a 
nonlinear current, which can also contribute to a meandering instability. The 
last term in (2.7) is purely kinetic and has been introduced by [20]. 0, denotes 
the tangential derivative along the steps. The term 0,(v0,(V«;)) represents 
diffusion along steps. 

All the introduced models assume implicitly a close-to-equilibrium situa- 
tion, which is not fulfilled in many applications related to epitaxy. This results 
from the linear dependency of the flux f;,4 on the equilibrium adatom density 
cs? in the boundary conditions (2.4) and (2.5). Step flow models without this 
assumption are rare [8, 1, 2]. The idea behind these models is a mean field 
approach, in which an atomistic description of the steps is incorporated into 
the discrete-continuum model by allowing for two additional quantities along 
the step: step adatoms and kinks. 

However, for the applications presented in the following several simpli- 
fications are possible: i) we can neglect nucleation Mr, ii) we can neglect 
desorption T by assuming the average time for an adatom to reach a step is 
much smaller than the time to desorb, which is fullfilled for small distances 
between steps, iii) we assume that there is one diffusion coefficient Dr for all 
terraces. Thus the system reduces to 





One; V. (DrV ci) = Fr (2.8) 

and 
—DrVc "n; = Tids (2.9) 
DrVCci—1 "n; = Jea (2.10) 


We further consider the case where the adatom concentration is continuous 
at the boundary. Thus the boundary conditions at the steps read 





NaF Vi 
sgass a 2 2.11 
Ci = ĉi—1 e (14 in) tat (2.11) 


where we have included a term av; accounting for kinetic attachment effects. 
In the velocity law we neglect diffusion along edges and the effect of the kink 
Ehrlich Schwoebel barrier. Thus we obtain 


Ui 
— = fi ean 2.12 
2, fi+ + fi, ( ) 
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2.2 Thermodynamic consistency 


As pointed out in [11] the above model is not necessarily consistent with 
the second law of thermodynamics. To ensure consistency restrictions on the 
constitutive relations (2.4) and (2.5) are required. [11] provide an alternative 
set of boundary conditions, derived from a configurational force balance, for 
which the second law holds, which corresponding to our situation reads 


fit = Ly (ua — [[wil]), (2.13) 
fi- = 1-1 — [fwi], (2.14) 


with u; the terrace chemical potential, defined through ui = ôe, Yi, with W;(c;) 
the terrace adatom free energy. Here w; = Y; — pic; is the grand canonical 
potential and [[w;]] = wi — wi-1. The jump in the grand canonical potential 
results from the discontinuity of the adatom concentration and might have 
significant influence on the stability of steps. As in the case of an ideal lattice 
gas W;(c;) is convex. If we further assume (0) = 0 and define the adatom 
equilibrium density cj? such that pi(c$1) = pwi-1(c;*) = 0 we can expand pu; 
and w; close to c? to obtain in leading order 


pa(ci) = lcg (ci — c°), (2.15) 
wi (ci) = wile) + cg ui (7) (ci — |"), (2.16) 


which inserted into (2.13) and (2.14) yield 
fit = ka les = c" — ci lle), (2.17) 
fi- = k-(ciz1 — G7 — c [lc:]), (2.18) 


with k+ = l+u;(c$1). The original form (2.4) and (2.5) is thus only obtained as 
a first-order approximation under the additional assumption that the adatoms 
are sufficiently rarified (c; < c41) such that the term c$" [[c;]] can be neglected. 














2.3 Spiral growth 


We will apply the BCF model in the following in its classical form to two 
important growth technologies and numerically investigate a spiral growth 
mode. Spiral growth remains to be an important growth mechanism for various 
materials, including compound semiconductors, see e.g. [48]. The growth mode 
is initiated from a screw dislocation from which a single step originates (see 
Fig. 2.3). Thus the growth is not limited by the nucleation rate and therefore 
allows growth also under low supersaturations [6]. Instead of the independent 
layer structure discussed so far, the crystal surface is now just one layer, which 
overlaps itself helicoidally. The crystal grows now by simply turning the screw, 
through the advancement of the spiral emanating from the disclocation [31], 
see also Fig. 2.4. 


Multiscale Modeling of Epitaxial Growth 71 





Fig. 2.3. Dislocation at a flat surface. This dislocation might trigger spiral growth. 
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Fig. 2.4. Step formation due to the pinning at a screw dislocation. 


A basic question arising in spiral growth is the prediction of the final 
surface slope, which is influenced by the interplay of diffusion processes on the 
terraces and along the steps, attachment kinetics at the steps and presumably 
also from mass transport in the bulk. 


2.4 Phase-field approximation 


A phase-field model for this situation was considered in [41], where formal 
matched asymptotic analysis was used to show the convergence for vanishing 
interfacial thickness and kinetic parameter to the sharp interface model (2.8)— 
(2.12). The phase-field model reads 





Oe. 2 zg S NaF 2 = / _ a* 
Go 3p = EST (e“ Ag — G'(d)) + ele- č), (2.20) 


where ¢ is the phase-field variable, € a small parameter determining the width 
of the diffuse interface and a a kinetic parameter. For applications in step 
flow or island growth G(¢) is a multiwell potential with minima at the dis- 
crete heights of the terraces. A more general phase-field model incorporating 
finite kinetic coefficients k4, k— has been introduced in [34]. An approxima- 
tion in which the effect of edge diffusion is considered was discussed in [42] 
and a combination of both and thus the general case described in Sect. 2 is 
considered in a combined phase-field level-set approach in [21]. 

In order to adapt the phase-field model to the special situation of a spiral 
growth mode we follow the approach in [23]. We define the multiwell potential 
as 


G(¢) = H(¢ — $0) 


with ġo representing the screw dislocation and defining an initial value; 27¢9 
defines the angle of (x,y) in the x — y plane. The phase transition happens in 
radial direction and ¢ measures the continuous height of the spiral. H is now 
defined through 
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H(p) = (k+1-—)?(k- )?, welk,k+1], kez. 
We use no-flux conditions on the boundary of the domain Q = (—L/2, L/2)? 
nc = Ond = 0. 


3 Application in Liquid Phase Epitaxy 


Liquid Phase Epitaxy [46] is the name for an epitaxial technique, which com- 
bines solution growth and epitaxy taking advantage of the potentialities of 
both. Its development was given impetus from (a) technical requirements for 
thin (1 to 5 um) garnet films, and (b) its ability to grow pure semiconductor 
compounds and their alloys. Later it was developed further to manufacture 
thin-film devices using compound semiconducting materials. It proved tremen- 
dous superiority to the early attempts to fabricate such thin films by means 
as e.g. polishing bulk crystals. Since then it has been successfully applied in 
many laboratories around the world. Moreover, it allowed to obtain funda- 
mental new information concerning the generic processes of crystal growth. 

Despite its large fundamental and technological impact, models of LPE so 
far are restricted to one-dimensional models predicting the evolution of the 
integrated growth rate h(t) of the epitaxial surface. First results were obtained 
neglecting hydrodynamic convection, which is a good approximation only for 
quiescent liquids. For the case of first-order surface kinetics, one can reduce 
this problem to various perturbation series [32, 27, 19, 49]. 

For LPE systems, where the assumption of a quiescent liquid is no longer 
satisfied, hydrodynamics effects have to be taken into account. Usually one 
imposes a forced flow to the substrate via a “rotating-disk” configuration. A 
rotating disk tends to draw material towards its surface in the normal di- 
rection, and then expells it radially in the manner of a centrifugal fan. If 
one applies this picture to LPE one has to take into account that the fluid- 
velocity’s normal component cannot vanish at the disk’s surface because mass 
conservation requires it to be proportional to the crystal’s growth rate [51]. 
So far, hydrodynamics have adressed this problem when the amount of “suc- 
tion” is indepentent of time. In this case the fluid flow problem is effectively 
decoupled from the mass-transfer problem [47, 52]. Numerical simulations of 
the coupled, time dependent problem are still lacking. 

Moreover, two-dimensional studies of the problem, which do not only re- 
solve the integrated growth rate but also the two-dimensional microscopic 
morphology of the substrate’s surface, are still an open issue. LPE is particu- 
larly rich in morphological features. It is tempting to apply [24, 12] the theory 
of Mullins and Sekerka [33] for an analysis of the stability of these morpholo- 
gies. But technically this is incorrect, because the unperturbed state is almost 
always time dependent. Thus numerical simulations of the full time depen- 
dent problem are essential to adress the question which processing parameters 
trigger which final stage morphology. 
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A previous approach to LPE developed by two of the authors [18] treats 
this problem assuming a quiescent solution, i.e. negecting fluid flow, but taking 
into account the evolution of the microscopic surface morphology. It is based 
on a phase-field model which allows us to consider precisely the physics of 
the underlying phase-diagram for the heteroepitaxial LPE growth process. 
Moreover, it applies to the situation, where the main driving force of the 
concentration field results from its gradients along the surface. 

In the following subsection we will focus on an extension of this previous 
approach, which takes into account also (a) adatom concentration gradients 
perpendicular to the surface and (b) hydrodynamic transport in the solution. 


3.1 A model for Liquid Phase Epitaxy 


We consider the growth of a thin epitaxial layer at the bottom of a domain 
Q C R? filled with a liquid solution that contains the adatoms as depicted 
in Fig. 3.1. The adatoms in the liquid solution are transported to the solid 
layer by diffusion and convection, there they may deposit on the epitaxial 
surface. At time t, the domain Q is partitioned into a domain Qz(t) filled by 
the liquid solution and a domain Q(t) filled by the solid phase. The interface 
S = S(t) between Qz and Qs is represented by the graph of a function h over 
the bottom surface So of Q, 


S(t) = {x + A(t,x)e3 |x € So}, (3.1) 


where e3 is the 3rd unit vector. Fluid flow and adatom transport in Qz are 
described by a Navier-Stokes system and a convection-diffusion equation, 


V-v=0 
(3.2) 
Ov +(v-V)v — náv = Vp, 
Ac’ +v- Ve" — DyAcY =0 (3.3) 


with fluid velocity v, pressure p, viscosity 1, mass specific concentration cV 


and diffusion constant Dy. The epitaxial growth is governed by a BCF-model, 





Liquid 
QL solution 





So Epitaxial layer S 
Qe 


Fig. 3.1. Liquid Phase Epitaxy 
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V S 
acl = DgAcS+—-— in So\ P(t), (3.4) 
TV TS 


cS = ~ (1+8 Ray/(kBT)), 
on T(t). (3.5) 





vr = Dg Qa | 
Here cf is the surface density of adatoms, Dg is a surface diffusion coefficient, 
Ty is the mean time for the deposition of adatoms from the solution to the 
surface, Ts is the mean time for the desorption of adatoms from the surface to 
the solution, the curve I describes the monoatomar steps, ur is the velocity 
of the steps, and Ea is the difference of the normal derivatives on both 


sides of the interface. 

The volume equations (3.2), (3.3) and the surface equation (3.4) are cou- 
pled by boundary conditions that model the conservation of total mass and 
mass of the adatoms. If oy and og denote the densities of the liquid and solid 
adatom phase, Ma is the mass of a single atom and Js is the density of the 
surface measure for the surface S, parametrized over So, then these conditions 


are 
7 1 1 V S 
pea (— z >) (= 5 =) and (3.6) 
Js \ov oE TV Tg 
aV m E Y 
Dy = S(t e") FS 3.7 
Y On Js ( i ) (< Z) ( ) 


The tangential fluid velocity at the interface S vanishes, v:(t,x) = 0 for 
x € S(t). The movement of the interface S is given by the condition 


ðh = Ze (Z = Z) . (3.8) 


OE \TV TS 


The model is completed by initial conditions for v, cV and c’, by an initial 
partition of Q into a liquid domain Qz(0) and a solid domain Qs(0) with 
interface S$(0) given by an initial condition for h(0,x), by initial steps T (0) 
on the solid layer So and by boundary conditions for (3.2), (3.3), (3.4). 


3.2 Homogenization and two-scale model 


Liquid Phase Epitaxy involves various different length scales: the epitaxial 
layer is measured in atom diameters, while the continuum models for diffusion 
and fluid flow, and also the model for the adatom diffusion on the liquid-solid 
interface, are valid for a much larger scale. The liquid-solid interface often 
exhibits a specific microstructure, whose scale is larger than that of an atom 
diameter but much smaller than that of the diffusion and fluid flow processes in 
the liquid solution. In this section we use homogenization techniques to derive 
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a model that is suitable for such situations. It is necessary to choose suitable 
scaling properties of some coefficients in dependence of a scale parameter 7 
that represents the size of the microstructure. The size of the adatoms is scaled 
proportional to 7, this leads to 

Na =n, ha =nh® and Ma = n?m? (3.9) 


a 
The epitaxial microstructure of scale 7 requires the relations 


Dg =7° D8, ty =n te, c=? and y= n ty. (3.10) 
In the limit n — 0 two phenomena must be described: small scale oscillations 
in the epitaxial microstructure, and boundary layers for the solutions of (3.2), 
(3.3) close to the interface. These phenomena require a combination of the 
standard homogenization technique for the oscillations and the technique of 
matched asymptotic expansions for the computation of the boundary layer. 
The homogenization is done for a simple periodic setting with periodicity 
cell Y C R? of area 1. A periodicity cell is a bounded Lipschitz domain with 
the property that R? can be represented as union of shifted copies of Y with 
empty intersection; the simplest example is the unit square Y = 0, 1]?. The 
corresponding initial conditions for the concentrations are 


cP (x) =n ?(x,x/n) for x € So and 
ee) = nc? (x) for x € Qz(0). 
The function c°° = c*°(x,y) is Y-periodic in its second variable y € Y. 


2 S 


The scaling of cf by n7? makes sense, because cf measures the number of 
atoms per unit area, and the area of single atoms is proportional to 77; the 
scaling by n~? keeps the area covered by adatoms constant. The scaling of cY 
is motivated by the exchange of matter between solid and liquid phase: the 
thickness of the solid phase is proportional to 7, therefore the total mass of 
adatoms that deposit on the surface is also proportional to 7. The scaling by 
7 accounts for this rate of mass exchange; it is reasonable for a dilute solution. 
The adatom density c* is expanded into the power series 


c? (t,x) = n?co (t,x, x/n) F nee (t,x, x/n) prata (3.11) 


The surface [ = I(t), its curvature xk = ky and its velocity vr = vry are 

also expanded in an appropriate two-scale sense, for details we refer to [16]. 
The outer expansions for the volume fields v, p and cV are valid far away 

from the liquid-solid interface S(t). They are given by the power series 


vn (t, x) = vo(t, x) T nvi(t,x) speek 


Pn(t, x) = po(t, Xx) ae n pi(t, x) TER ’ 
cy (t,x) = ney (t,x) + cl (t,x). (3.12) 
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The inner expansion for v,p,c" is valid close to S,(t). Since this interface is 
a priori unknown, the inner expansion is done after the transform of variables 
Š =x —h,(t,x)e3. Then v,p,c” are expanded in power series 


Vn (t,x) = vo(t,x,x/n) + n vilt,x,x/) +>, 
Py(t,x) = po(t,x,x/n) + npilt,x,x/n) +, 
ch (t,x) = ney (t,x, x/n) + n°cY (t,x, x/n) + 


This expansion combines both the homogenization by formal asymptotics 
and the inner expansion for the boundary layer. The former is realized 
by Y-periodic oscillations of the components (y1, y2) in uY (t,x,y) for u € 
{v,p,cY}, the latter via the component y3. As a consequence, the functions 
y > u,(t,x,y) for u € {v,p,cV}, j = 0,1,2,..., are defined in the do- 
main Y x (0,-+00), with periodic boundary conditions at the lateral surface 
(y1,Y2) € OY. The condition for y3 — +00 is derived by matching the inner 
and outer expansion and the condition for y3 = 0 is given by the coupling 
conditions (3.6), (3.7). The height function h = hy is also expanded into a 
series, 


hy(t,x) = n ho(t, x, x/n) + n hi(t, x, x/n) +. (3.13) 


Using these formal expansions in (3.2)—(3.8) it is possible to derive a two-scale 
model that consists of the following relations: 


e Macroscopic equations for fluid flow and solute diffusion in Q: 


Ach + vo: Vey —DyAcy =0, (3.14) 
V-vo=0, 

an (3.15) 

OLVo + (vo 7 V)vo m nAvo + Vpo =0. 


e A microscopic BCF-model in Y to be solved for every x € So: 


1 1 
cS — DL Aycs = =o — —c§ fory €Y, (3.16) 
Ty TS 
cp = (1+ KoM2y0/(kBT)), 
wry = DY. ES for y € Ip(t,x). (3.17) 
On 
e Coupling conditions on So 
Och TS cy 
Dy =m | 2-2 1 
ve = mt (2-3), (3.18) 
v=0 (3.19) 


with the microscopic mean value T8 (t,x) = Jy (t,x, y) dy. These equa- 
tions serve as boundary conditions for (3.14), (3.15) on So. 
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The details of the derivation can be found in [16]. The equations for fluid flow 
here decouple from the other equations. The fluid velocity can be computed 
in a pre-processing step, and the main computation consists in solving the 
remaining equations with given fluid velocity. 


3.3 Analysis of the two-scale model 


An analysis is available for a slightly modified version of the two-scale model, 
where the sharp interface version (3.16), (3.17) of the BCF-model is replaced 
by a the following adaption of the phase field model (2.19), (2.20) to liquid- 
phase epitaxy: 
Y g 
d:c — Dg Ayc§ + (22)*Ado = g S- A, (3.20) 
Ty TS 


aoegðtpo — Ez Aypo + G'(d0) + poles, po) = 0. (3.21) 


The function po here represents a possibly generalized version of the term 
Shes (c — c*) in a rescaled variant of (2.20). The lower indices 0 indicate 
solutions of the homogenized problem. A basic assumption for the analysis is 
that G” and p are Lipschitz functions with at most linear growth, |G’(¢)| < 
C(1 + ||) and po(e,¢) < C(1 + |e| + |¢|). For the modified model (3.14), 
(3.15), (3.20), (3.21), (3.18), (3.19), the existence and uniqueness of its solution 
can be proved, see [16], Theorem 1. The main tool of the existence proof is 
the Schauder fixed point theorem, applied to a composition of the solution 
operators for the family of microscopic problems on one hand and for the 
macroscopic problem on the other hand. 

For the phase field versions of both the original and the two-scale model, 
an estimate of the model error is avaliable. The original model of scale 7 is 
given by the equations (3.2), (3.3), (3.6)-(3.8) and the phase field version of 
the BCF-model 





V S 
de® — Ds AcE + 7ta = — - =, (3.22) 
TV TS 


ae0,b — Ao + G'(o) + plci, p) = 0. (3.23) 








The scaling of the phase field variables £ and a is € = nép and a = 7~7€9, the 
relation between p and po is 


p(c, o) _ po(n’e, o). 


This scaling property is in harmony with the scaling of c’ in (3.11). 

Let (Vn, Pn: CK , CS, Øn) be the solution of the described original problem 
for scale 7 and let (vo, po,c¥, c$, po) be the solution of the described two-scale 
model. Due to the scaling relations (3.11), (3.12) the comparison of models is 
done with the rescaled functions 
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ad = Hep, and E = if oe 
In order to compare the functions es , dn of variables (t,x) and the functions 
cS, bo of variables (t, x, y) we employ macroscopic reconstructions of two-scale 
type 

u(t,x) = uo(t,x,x/n) for u = ce Q. 


Let Q,,(t) be the liquid domain and S,,(t) be the liquid-solid interface for the 
problem of scale 7. The boundary conditions for both the original model of 
scale 7 and the two-scale model on 0Q,,(t) \ S(t) are Neumann conditions for 
ae cy and Dirichlet conditions for v}, vo; the given boundary data for both 
models are identical. Further boundary conditions are homogeneous Neumann 
conditions for cf , dn on OSo, for cS and ¢g there are no boundary conditions 
on So, because there are no x-derivatives in the corresponding differential 
equations. The initial conditions for both models are compatible in the sense 
éy (0,x) = cf (0, x), Vn(0,x) = vo(0,x) and 


ČS (0, x) = c§(0,x,x/n), n(0,x) = ¢0(0, x, x/n). 


Then under some moderate additional assumptions on the regularity of the 
solutions the following estimate for the model error is availabe, see [16]: 


lez 


mT c” lle cr.ta(so)) i Pn 


n 
— £5 ll e02.(7;20(50)) 
V o V 
+ sup (E -DOl +I) 824) 
< Cr/?. 
The presented estimate reveals an approximation order of nl ? that is lower 
than the order 7 as could be expected from the asymptotic expansion. This 
deterioration has two reasons. First, the macroscopic fluid flow and convection- 
diffusion processes happen in the whole domain Q for the two-scale model and 
in the liquid domain Q,,(t) for the original model. These domains differ by 
a volume of size proportional to 7, which leads to an error of order 7!/? in 
L-type estimates. Second, in a general setting it must be expected that the 
microstructure evolving at the boundary of So does not occupy a full shifted 
periodicity cell 7 Y but some “truncated” domain; this generates an additional 
error of order 7!/? in the L2(So)-norm. 


3.4 Numerical approach 


The scheme we employ for the simulations in the following section is based on 
the method of lines in time. After discretizing the problem by finite differences 
in space, we solve the resulting ODE system by the standard Runge-Kutta 
method of fourth order with fixed time step and impose a constant velocity 
field defined as 
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Vijk = (voy/anY0,0) 5 


where the ijk denote the three spatial directions and hY the mesh size in 
the z direction in the volume. Moreover, in the following simulations we have 
constant inflow at the left hand side of the simulated region and free outflow 
at the right hand side. 


3.5 Simulation results 


First simulations allow to visualize e.g. how the variation of the macroscopic 
flow field as depicted in Fig. 3.2 is accompanied by a variation of the con- 
centration field in the melt which in turn results in a systematic variation of 
microstructure evolution as depicted in Fig. 3.3. The parameters for the sim- 
ulations are given in Table 3.1. Systematic parameter studies to show the lim- 
itations of previous perturbative studies [24, 12] of microstructure evolution 
in LPE are still open, just as their analysis to extract relations between pro- 
cessing parameters and morphological stability of the material sample which 
promise to allow for a desired more efficient parameter control in the sensitive 
crystallization process of LPE grown material systems [13]. 


Macroscopic concentration and velocity 





Concentration 
0.976 





0.698 


Fig. 3.2. Dynamics at the macroscale for the parameters given in Table 3.1. 


Table 3.1. Simulation parameters for simulations depicted in Fig. 3.2 and 3.3 


a} 1 A1 }19.591 Ts| 5 2a | 2 Ma 0.009 
é| 1 Ds| 10 Tv | 2 Dy}10 vo | 0.1 
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Epitaxial surface Surface height 
29.6 


26.2 


19.4 





16.0 


Fig. 3.3. Dynamics at the microscale for the parameters given in Table 3.1. 


4 Application in Molecular Beam Epitaxy 


Here we will consider Molecular Beam Epitaxy (MBE), which allows a decou- 
pling from macroscopic processes in the growth furnace and the microscopic 
description on the surface of the growing film. Thus the model in Sect. 2 is 
directly applicable. 


4.1 Phase-field approximation 


Now we are concerned with the phase-field model (2.19)—(2.20). Measuring 
the length in atomic distances (i.e. Ra = 1) the equations read 


Ow” + Onc =V. (DrVc) + F (4.1) 
ae’ p = L (2Ag — G'(¢)) + elc — č). (4.2) 
kBT 


We consider the initial conditions 
(x, 0) = do(x), c(x,0) = č, 


where the function ġo introduced in Sect. 2.4 represents a screw dislocation 
and c* is the equilibrium concentration for straight steps. Furthermore we use 
no-flux conditions on the boundary of the domain N = (—L/2, L/2)? 


Onc = Ond = 0. 


4.2 Numerical approach 


Following the numerical approach for a viscous Cahn-Hilliard equation de- 
scribed in [40] equations (2.19)—(2.20) are discretized in space by linear finite 
elements and in time in a semi-implicit way. Here the derivative of the double 
well G’(@) is treated explicity. The resulting linear equations are solved as a 
system. 
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4.3 Simulation results 


The algorithm is implemented in the adaptive finite element toolbox AMDiS 
[50]. In the simulations the following parameters are used: Dr = 10.0, 
c = 0.1, or = 1.0 and a = 1.0. Fig. 4.1 shows the phase-field variable 
and adatom density at a given time instant after the growth of several mono- 
layers. A more quantitative picture is given in Fig. 4.2, which shows the devel- 
opment towards a stationary profile. For t = 1500 the width of the terraces, 
the stepspacing l, is constant. We can now investigate how the stepspacing l 
depends on the deposition flux F. Before we analyse this numerically let us 
consider a theoretical approach used in [23]. The stepspacing l can be com- 
puted by solving the cubic equation 


e 3 F 14Drc*ya 
ie A ee 
jar.) Abe ( l keT 


A comparison of the numerical results with this theory is given in Table 4.1. A 
second interesting quantity is the surface width w, which is a common measure 





Fig. 4.1. Phase-field variable ¢ (left) and adatom concentration c (right) at a given 
time instant. 





Fig. 4.2. Phase-field variable ¢ at different time steps t = 0, t = 100, t = 300 
and t = 1500. The deposition flux used is F = 0.2. One only sees the height profile 
measured from the minimal terrace height. 
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Table 4.1. Comparison of theoretically and numerically obtained step spacing in 


Foose 


= 6.1/8.0}10. 13.3 
[simulation 6.6/8.5] 11. 15.0 


stationary profile. 








FEFE 
MON 

















0.04 0.06 0.08 0.1 
Ft/(LA)® 


Fig. 4.3. Rescaled surface width wt as a function of rescaled time ee for various 
values of L. 


of film roughness and is defined as [23] 


1/1 a He 


with the mean value ¢ = |Q|~ fo ¢ dx. In the limit t — œ one obtains 


. L 
oe) DA 
Fig. 4.3 shows the data collaps by rescaling the surface width and time using 
this ansatz. Thus the time needed to reach the steady state solution scales as 
(L/l)°. 
A similar approach with the incorporation of the kinetic boundary condi- 
tions (2.4), (2.5) is under investigation [26]. 
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1 Introduction 


The aim of this work is to study the sharp interface limit of the Cahn-Hilliard 
equation in situations in which elastic stresses appear. The Cahn-Hilliard 
equation is a phase field model in the sense that interfaces are diffuse, i.e. 
across an interface an order parameter representing the phases changes its 
state rapidly, but in a smooth way. If elastic stresses are present, the Cahn- 
Hilliard equation has to be coupled to an elasticity system and this extended 
set of equations is called the Cahn-Larché system. For the Cahn-Hilliard equa- 
tion it is well known that if the interfacial thickness £ > 0 tends to zero, the 
Mullins-Sekerka model is recovered. The Mullins-Sekerka model is a sharp in- 
terface model and can be formulated as a classical free boundary model. Also 
the sharp interface model can be extended to include elastic effects and it is the 
goal of this paper to discuss recent attempts to relate the Cahn-Larché system 
and the elastically modified Mullins-Sekerka model. We refer to the article by 
Garcke et al. [GL*06] in this book for more information on phase separation 
and Ostwald ripening which are both phenomena that can be modelled with 
the help of the Cahn-Larché system and the extended Mullins-Sekerka model. 
We also refer to [GL*06] for a discussion of situations where the two models 
can be reasonably used to recover the above phenomena. 

Some work has been done already to study the sharp interface limit of 
the Cahn-Larché system. Fried and Gurtin [FG94] and Leo, Lowengrub and 
Jou [LLJ98] used the method of formally matched asymptotic expansions to 
relate the two models. Using this technique one has to assume that a smooth 
solution of the sharp interface model exists and fulfills certain smoothness 
properties, but to our knowledge there are no rigorous results known so far 
for the asymptotic limit of the Cahn-Larché system. 

We present three results which relate the Cahn-Larché model to the sharp 
interface model. In Sect. 4 we will first show that the Cahn-Larché free en- 
ergy, which is a Ginzburg-Landau type energy supplemented by contributions 
from elasticity, has a I -limit for € tending to zero. The -limit contains the 
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classical surface energy together with elastic terms. Furthermore we will show 
in Section 4 that one can pass to the limit in the Euler-Lagrange equation for 
minimizers of the Cahn-Larché energy in order to obtain an elastically mod- 
ified Gibbs-Thomson equation. This result generalizes a result of Luckhaus 
and Modica [LM89] to the Cahn-Larché system. 

For general solutions we are going to use arguments and techniques from 
geometric measure theory to get rigorous results. Here one uses a priori esti- 
mates and compactness arguments to show convergence of the concentration, 
the chemical potential and the deformation vector. The main part is then to 
derive the Gibbs-Thomson law from the Cahn-Larché system. We are going to 
use methods introduced by Ilmanen [Ilm93], Soner [Son95] and Chen [Che96] 
in order to perform the limiting process in the context of the theory of vari- 
folds. The analysis for the Cahn-Larché system is more complicated due to 
the fact that elastic terms appear in the Gibbs-Thomson equation through 
the so-called Eshelby tensor. 

The outline of this work is as follows. After introducing the governing 
models in Sect. 2 we review basic knowledge on geometric measure theory 
and present related work in Sect. 3. In Sect. 4 we consider the stationary case 
and in Sect. 5 we discuss the general case in the context of geometric measure 
theory. Sect. 5 is part of the ongoing PhD thesis of the second author and we 
refer to the thesis [Kwa06] for more details. 


2 The models 


We start reviewing elasticity theory and the models which our analysis is 
based on. 


2.1 Introduction to mechanics 


We shortly introduce the basic concepts of linear elasticity, for a detailed intro- 
duction we refer to [Gur72], [Cia88] and [Bra91]. Denoting by a bounded region 
N C R” the reference state, we introduce the deformation vector u: 2 — R”. 
Since in the applications we have in mind only small deformations appear, we 
consider a theory which is based on the linearized strain tensor 


E(u) = 1/2(Vu + Vu’). 
The elastic energy density W is typically of quadratic form 
W(c,€) = $(€ — E*(c)) : CIA (E — E*(0)) (2.1) 


with a symmetric and positive definite elasticity tensor C(c). We call E*(c) = 
E*c the eigenstrain corresponding to c which describes the energetically fa- 
vorable strain at concentration c. If C(c) = C does not depend on the con- 
centration, we speak of homogeneous elasticity, otherwise we use the term 
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inhomogeneous elasticity. For the theory we are going to present in this work 
we will make the assumption that for a suitable constant C > 0 the following 
properties of W hold 
W e CH(RxR”*” R) such that 
IW (c, £)| < C(1 + lel? + |El’), 
|W elc, E)| < C + |e]? + |El), 
|W.-(c,€)| < C(1 + [e] + JEJ). (2.2) 








We assume in addition that W(c,€) only depends on the symmetric part 
of € € R"*” and W e is strongly monotone, i.e. there exists a constant cı > 0 
such that 


(W.e(c,€1) — Wee(e, €2)) : (E2 — E1) > aal€o — El’. (2.3) 


We remark that an elasticity energy W according to equation (2.1) with 
E*(c) = E*c does not fulfill (2.2), if the elasticity tensor C depends on the 
concentration c. 

The mechanical equilibrium is attained on a much faster time scale com- 
pared to concentration changing by diffusion. This is why we assume that 
the mechanical equilibrium is attained instantaneously, so that the equation 
for the mechanics (2.4) does not involve any time derivatives and we hence 
consider at each time t > 0 the quasi-stationary system: 


div S = div W e (c, E(u)) = 0 (2.4) 


where S = S(c,E) = W e(c, E) is the stress tensor. 
For definiteness we demand the deformation vector u to be in K with 


Xira := {u € H? (Q, R”) | there exist b € R” and a skew symmetric 
A € R””” such that u(x) = b + Ax} 


and Ka is the space perpendicular to Xi;q where perpendicular is meant with 
respect to the H!?-inner product. We remark that the energies of both the 
phase field and sharp interface models depend on u only through €(u) and 
hence the infinitesimal rigid part of u has no influence on the evolution of c. 
We have the Korn inequality 


lulla) < ČIE) Ilzo) 


for all u € Xi for some constant C (see Zeidler [Zei88]). In particular we ob- 
tain using (2.3) and an energy argument that u € X;,q is uniquely determined 
by (2.4) and a stress-free boundary condition. 
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2.2 Phase field model 


The Cahn-Larché model is based on the Ginzburg-Landau energy 


E 1 
E$ (c, u) = f (Sve? + aL) + W(e.£(u)) (2.5) 
where £ > 0 is a small parameter related to the thickness of the diffuse in- 


terface, c is a scaled concentration difference, Y is a polynomial double well 
potential which we take to be 


W(c) = Le — 1)’. (2.6) 


In the diffuse interface model the evolution problem related to (2.5) is the 
Cahn-Larché system 





Oc = Aw, (2.7) 
ES; 
S P= —e4Ac + 40'(c) + W,-(c, E(u)), (2.8) 
c 


E 


6E 
div S = div —2! 
ou 





=0 (2.9) 


where w is the chemical potential. We can view this system as the H~! gra- 
dient flow of the energy functional (2.5), see [GL*06]. This structure will lead 
to crucial energy estimates of the Cahn-Larché system. The existence of solu- 
tions to this phase field system has been shown in [Gar00] and [Gar03]. The 
results are cited in Subsection 3.2. 


2.3 Sharp interface model 


The energy for the sharp interface limit is given by 


Bu= f adh" Y | WelE(u)) de (2.10) 
r ka+,— “2k 


where o > 0 is a surface energy constant and I is the interface (a hyper- 
surface). The notation f, .dH"~' denotes the integration with respect to the 
(n — 1)-dimensional surface measure (the Hausdorff measure) and 2_, 24 are 
the distinct regions occupied by the two phases with the corresponding elastic 
energy densities 


W_(€):=W(-1,€), Wy := W(+1,€). 


To simplify notation we set W} = 0 in 2_ and vice versa, since then we can 
write )°, Jo, Wk = fo Xr Wr. Furthermore the surface energy density is 
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o=}| GEOP HECU) 
where z is the solution of 
—2" +W'(z)=0 with z(—oo) = —1 and z(oo) = 1. 


One can easily compute 


o= f 2V f VOPR dy. 


The evolution problem related to the sharp interface energy is a modified 
Mullins-Sekerka problem 


Aw =0 in Q_(t) and Q(t), (2.11) 

V =-3[Vu]t -v on T(t), (2.12) 

w = øk + $v" [WId — (Vu)? S]tv on I(t), (2.13) 

div S = 0 in Q_(t) and Q(t), (2.14) 
[Sut = [ult =0, fut =o on T(t) 


where N_(t) and 92,(t) are the regions occupied by the phases at time t, 
I(t) is the interface separating these regions, v is the unit normal along the 
interface pointing towards 2,, V is the normal velocity of the interface and 
[.]* denotes the jump of the quantity in the brackets across the interface, e.g. 
[w}* = wt—w-. «is the mean curvature of I (t) with the sign convention that 
k is positive, if I(t) is curved in the direction of v. In contrast to its standard 
definition the mean curvature is taken here to be the sum of the principle 
curvatures. The first two equations are classical laws describing quasi-static 
diffusion driven by a chemical potential w. The third equation is the modified 
Gibbs-Thomson equation stating that the system is in local thermodynamical 
equilibrium. 

Since we want to restrict our analysis to closed systems, we take homoge- 
neous Neumann boundary conditions. In the phase field model this means 


Ve-vg=Vw-vg=0, Svg=0)0, 
where vp denotes the outer unit normal of 2. In the sharp interface model the 


condition for the concentration changes to an angle condition for the interface, 
so altogether the boundary conditions for the sharp interface model are 


L(T (t), 3N) =90°, Vw-ve=0, Svg=0. 


3 Preliminaries 


We introduce notations and recall some known facts about measures and var- 
ifolds (see also [EG92], [Fed69] and [Sim83]). We end this section by precisely 
stating the problems we want to analyze in this paper and with a discussion 
of related work. 
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3.1 Geometric measure theory 


First we recall the definition of a Radon measure js on an open set 2 C R” 
as a Borel regular measure that is finite on compact sets. To a measure u we 
introduce the notion of densities on 2 for x € Q 


QAB 
Ora ii, x) = lim sup HAEE) M ola) 
p—0 Wn—1P 


027} (u, x) = lim inf u(20 B(x) 
* ’ pan . 


Es 


Wn—1p"— 1 


Here wn—1 is the volume of the (n — 1)-dimensional unit ball. If 6*"~1(, £) 
and 6”~1(,x) coincide, this common value will be denoted by 0”~1(, £). 
Now we look on the set of (n — 1)-dimensional subspaces 





P"—! := {P| P is a (n—1)-dimensional subspace in R”} = S"~*/{+1}. 


We will use the same notation P for the orthogonal projection onto the sub- 
space P. On P”~! we use the metric induced by endomorphisms: 


This enables us to define a varifold: 


Definition 3.1. A varifold V is a Radon measure on the Grassmanian 
G(2):= 2 x pr-t. 


Remark 3.2. 


e Such varifolds are in fact (n — 1)-varifolds. We use such varifolds, since we 
want to describe interfaces. One can see them to give spatial and tangential 
information independently of each other. 

Defining varifolds simply as Radon measures on 2 x P”—!, we have weak- 
ened the usual view that the tangential information is solely given by the 
spatial information (of a neighborhood). 

e Fora C!?-hypersurface M, we can introduce a corresponding varifold V by 
setting 

dV (x, P) = dH”! | (2) ôr, m(P). 


Finally we introduce the mass measure of a varifold. 


Definition 3.3. The mass measure of a varifold is defined by 


py (A) := i dV (x,P) for ACQ. 
Axpnr-1 
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Fig. 3.1. An example where phase field interfaces lead to a varifold in the sharp 
interface limit. 


The motivation to use varifolds is that the limiting interface will not pro- 
vide sufficient smoothness to fulfill some kind of Gibbs-Thomson law in the 
classical sharp interface sense. In fact Schatzle has shown in [Sch97] that even 
the BV-formulation of the Gibbs-Thomson law breaks down when two inter- 
faces touch each other. Introducing the notion of varifolds enable us to come 
up with a formulation which extends the model beyond the time of topological 
changes. 


Remark 3.4. Bronsard and Stoth studied the related Allen-Cahn equation and 
proved that in the limit there exist interfaces with arbitrary high multiplicity, 
also called phantom interfaces, see [BS96]. Figure 3.1 gives an illustration of 
a time-independent example. Assume that the two regions of approximations 
x* merge to one when letting € — 0. Then the dashed line is a phantom inter- 
face. Such phantom interfaces are not captured when using only characteristic 
functions. 


First Variation of a varifold 


In the smooth classical sense the Gibbs-Thomson law incorporates the mean 
curvature «. Actually the curvature term occurs through the first variation 
of the area. For varifolds one has to use the first variation formula derived in 
Allard [A1172] and Simon [Sim83}. 

As it can be found in the aforementioned works of Allard and Simon, the 
first variation of a varifold is given by 


ôV(X) = DX(a)-PdV(x,P) for X € Co(2,R”) (3.1) 
G(2) 
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where DX (x) - P is defined to be the inner product between linear mappings 
and DX (a): P turns out to be the divergence of X with respect to the linear 
subspace P. 

In fact, this coincides with the mean curvature in the smooth case. Using 
the Gauss theorem on a C?-hypersurface M 


f X-vmsman = f div m X dH”! 
M M 


with vm an arbitrary unit normal to M and km the mean curvature of M 
with the sign according to vm. One notices that for X € Cj(2,R”) the 
variation of the area can thus be read as the surface divergence of the vector 
field, i.e. the full divergence minus the normal part of DX. 

In the case that the varifold is less smooth, but still has locally bounded 
first variation, one gets the following decomposition: 

If ||V || is a Radon measure, i.e. 





YK CC QAcK>0: |6V(X)| <cK||X|lo VX € Ch(K,R"), 


the first variation of V can be seen as a bounded operator on C°(2,R") and 
one has a ||6V||-measurable function v: R — P"~! such that 


5V(X) = -f X -vdljôv|. 
Q 
We now take the Lebesgue decomposition of ||SV|| with respect to py: 


V) = f x-vajvy)= | X-Hvdw+ f X-vdo (3.2) 


where Hy is the Radon-Nikodym derivative of ||6V|| with respect to uy mul- 
tiplied with the normal function v: 


Hy (2) = v(x) Dyy ||6V||(2). 


Hy is called generalized mean curvature vector. The set of singularities 
Z := {x € R” | D,,||6V||(x) = co} is the generalized boundary of V with gen- 
eralized boundary measure o, generalized unit co-normal v|z and uy (Z) = 0. 


Rectifiability 


For a (n — 1)-rectifiable set M C N there exists for H"~1-a.e.  € M the 
approximate tangent plane to M, denoted by T2PP M (see [Sim83] for details). 
To such a set M one can associate a varifold Vm by setting 


Vu (A) := H+ ({x € 2 | (x, TPP M) € A}) for AC G(M). 
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Definition 3.5. A varifold V is rectifiable, if there exist 0; > 0 and (n — 1)- 
rectifiable sets M; C 2 fori e N such that 


V =Y 0i Vm 


iEN 
Since varifolds represent an abstract concept, one goal is to confirm recti- 
fiability, if not even integrality, which is the case when all 6; are integers in 
the above identity. 


The relation between rectifiability and the first variation is stated in the 
following theorem by Allard (see [A1172]). 


Theorem 3.6 (Allard). Suppose a varifold V has locally bounded first vari- 
ation in Q and 0"—*(y,2) > 0 for py-ae. x E€ N, then V is already a 
rectifiable varifold. 


Remark 3.7. Especially for a varifold V with locally bounded first variation 
in Q the restriction of V onto {x | 6*”~1(uy,x) > 0} x P” is rectifiable. 


The next theorem by Schatzle shows that the structure of the first variation 
can lead to the desired rectifiability (see [Sch01)). 


Theorem 3.8 (Schatzle). Let W be a varifold in Q C R”, w € H'?(Q), 
n/2<p<n, FC R such that the characteristic function xr lies in BV(Q2). 
Furthermore we suppose 

1. W (n) = fo div(wn)xr Vn € Cà(2, R”), 

2. |Vxr| < uw and 

3. ||w|| aneco + uw(2) < A for some AER. 


Then W is rectifiable and has locally bounded first variation satisfying 


|| Hw | 





Ls (uw LB(xo,r)) Š Cn, plr) A tS VB(zo, 2r) G 2, 


where s E€ R such that nat = z~ 1. 


The main part of the proof is to show a particular monotonicity formula 
for the density of the mass measure: 


Lemma 3.9 (Monotonicity Formula). For a varifold W which fulfills the 
assumptions of Theorem 3.8 the function 


p= p ™ "Y uw(Bpl(z0)) + Crp min(1, d) tA% YzroE N,0<p<d 
is non-decreasing for a = 1 — + € (0,1) with d = dist(xo, 02). 


Once this monotonicity formula is verified, one can use the following the- 
orem by Ziemer. 
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Theorem 3.10 (Ziemer). Let uw be a Radon measure on R”. Then the fol- 
lowing statements are equivalent: 


1. H"~'(A) = 0 implies that (A) = 0 for all Borel sets A C R” and there 
is a constant C such that IJ ody| < Clldllaver) jor all o € BV(R”). 
2. There is a constant C such that p(B(x,r)) < Cyr ; 


By the theorem of Ziemer we obtain from Lemma 3.9 local bounds for the 
measure uw, i.e. for all 6 € BV(Q) and Bp(zo) C 2 


| i 6 x2, ander Zi alive. 


Now, we choose ¢ = |w|*, which is in H':1(Q) by imbedding theorems, and 
the first variation of the varifold W can be therefore estimated by 








Ls* (uw) 


Wn) < | i; (umdu! < 


By this estimate the first variation can be interpreted as a Radon measure and 
the above inequality leads to rectifiability of the varifold through the theorem 
of Allard. 


3.2 Assumptions and notations 


We start with solutions of the Cahn-Larché systems fulfilling the following 
assumptions (see also [Gar00] and [Gar03]) for 2 C R” open and bounded 
with smooth boundary. We consider for all € > 0 

c € Liocl0, 00; H??(2)) N Aygo (0, 00; H7? (2)), 

wF € L? (0,00; H»?(2)), 

uf € Li,,(0, 00; H*(2)") 


such that the following weak formulation is fulfilled for all T > 0 


T T 
I CERAT E | | Vu se dads (3.3) 
0 OJR 
T T 
| if wo dxdt = ij | EVE -Vea + 2W'(C*) Ca +W.-(c,u’)C2 dxdt, (3.4) 
OJR 0472 


o= f f S: Dede (3.5) 


for all & € L(0,T; H'(Q)), ĉe € L(0,T; H120) N LO(R x J0, T]) and 
¢3 € L?(0, T; H'?(Q)"). Here, the notation of £ € L? (0, 00; H+? (2)) means 
that for all times T > 0 one has c® € L?(0, T; H'?(Q)) and (.,.) is the duality 
pairing between H~'?(Q) and H'?(Q). In contrast to other notations we 
define H~'?(Q) as the dual of {c € H+?(Q) | fg e= 0}. 
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As initial conditions we assume that for all € > 0 


1. the initial energy is bounded: Ef-(0) < Eo and 
2. the integral of the initial concentration does not depend on ¢, i.e. there 
exists a constant mo € (—1,1) such that fo c§ = mo| 2]. 


Remark 3.11. The existence of weak solutions of the Cahn-Larché system has 
been shown in [Gar00] and [Gar03]. But so far it has not been verified in 
general, if the concentration and deformation vector are indeed in H??() 
for almost all t. 

In the case that W is of the quadratic form (2.1) with constant elasticity 
tensor C, i.e. in the homogeneous case, the equation determining u can be read 
as an elliptic system with constant coefficients, where only the right-hand side 
depends on the concentration: 


divS=0 <= > div(CE(u‘)| = div[(CE*(c*)]. 


Since c® is in H':?(Q), the right-hand side is in L?(), which leads u£ to be 
in H?-?(Q) by elliptic regularity theory. 

On the other side, W e = €* : C[E(u®) — E*c*] is in L?(2). Now, equation 
(3.4) can be read as an elliptic equation for cë and again elliptic regularity 
theory can be used. 

If one considers inhomogeneous elasticity, the elasticity system (2.4) con- 
tains possibly non-continuous coefficients C (cf) so that one cannot argue as in 
the homogeneous case. Though, in low dimensions due to Sobolev imbedding 
theorems the concentration functions cê are continuous and therefore elliptic 
regularity theory for smooth coefficients can be used. 

Nevertheless, for the general case we are presenting in this work we have 
to include this assumptions in order to get the correct Gibbs-Thomson law, 
see Sect. 5.4. 


One important first observation for the limiting process € — 0 is to identify 
e*(c*) = SVE]? + 4W(c*) 


as the interfacial energy density in the phase field model. Heuristically, this is 
exactly the quantity one observes to carry the interfacial energy of the phase 
field model, and the goal is to show convergence to a quantity that will be 
understood up to a factor as the H”~!-measure of the interface. 

The second important function is the so-called discrepancy measure 


E (ce) = Eve]? — ip(e). (3.6) 


As it is stated in Theorem 5.7, in the limit € — 0 the discrepancy measure will 
be non-positive, which means that the Y-part is larger than the |Vc*|?-part. 
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3.3 Related works 


One important source of this work is the paper by Chen [Che96]. He has 
studied the asymptotic limit of the Cahn-Hilliard model. Chen showed for ar- 
bitrary spatial dimensions that solutions of the Cahn-Hilliard system converge 
globally in time to some generalized sharp-interface solution. He did not show 
that the limit varifold is rectifiable, but in the case p = 2,n = 3 one can use 
the Theorem 3.8 by Schatzle to deduce rectifiability for the limit varifold for 
the Cahn-Hilliard systems without elasticity, see Remark 5.3. 

There is one significant difference to results for the related Allen-Cahn 
models which are proposed to describe motion of phase boundaries driven by 
surface tension: 


0 
°F = eAc—e'W'(c). 
Ilmanen [Ilm93] has studied the limiting behavior of the Allen-Cahn equa- 
tion towards the mean curvature flow in the sense of Brakke [Bra78] and 


confirmed that one gets in the limit 


€=0. 


This is also known as equipartition of energy. It is quite interesting to note that 
the interface energy is asymptotically equally distributed between the |Vc*|?- 
and the ¥(c*)-part. Moreover this result can be used for further results, namely 
it is easier to deduce the fact that the resulting interface varifold is rectifiable. 

After Ilmanen [Ilm93] first used geometric measure theory to prove such 
convergence in N = R”, Soner [Son95] improved the result for more general 
settings. Hutchinson and Tonegawa studied in [HT00] the asymptotic behavior 
of critical, not necessarily minimal points of the Cahn-Hilliard energy func- 
tional. In their work they also used geometric measure theory and derived local 
estimates for the discrepancy measure (3.6). By that, they gained convergence 
results for bounded domains. In their (time-independent) setting the limit var- 
ifold turns out to be integral, i.e. the interface has indeed integer multiplicity 
modulo a surface constant almost everywhere. Moreover local minimizers of 
the Cahn-Hilliard energy functional converge to a local area minimizer subject 
to a volume constraint. Later Tonegawa extended with similar estimates the 
results by Ilmanen and showed that time-dependent solutions of the Allen- 
Cahn equation converge to an integral varifold, cf. [Ton03]. 


4 The stationary case 


Before we study the evolution problem, we consider the sharp interface limit 
of the Ginzburg-Landau energy Ef,(c,u) in the limit e tending to zero. As 
the Cahn-Larché system conserves the integral of the concentration c, we will 
consider Ef; subject to an integral constraint on c. In fact in this case one 


Pp 


can show that Esi is the T-limit of Eps even if we take the constraint into 
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account. Furthermore, we present a result stating that the Lagrange multipli- 
ers related to minimizers of Ex, will converge to a Lagrange multiplier related 
to a minimizer of Es; subject to a volume constraint. The results we present 
will generalize results of Modica [Mod87] and Luckhaus and Modica [LM89] 
to the case including elastic effects. 


4.1 The T-limit of the Cahn-Larché energy 


In this subsection we study solutions of the variational problems: 


(P*) Find a minimizer (c,u) € H'?(Q) x Xira of Ep subject to the 
constraint Ta Jo c = mo, where mo € (—1,1) is a given constant. 


We will now present a result stating that solutions to (P€) converge along 
subsequences to a minimizer of the functional 


B®: LEQ) x Xt, > RU {oo} 
where 


2H"! (fe =1}N N) + Ja W(c,E(u)) if ce BV(Q,{—1,1}) 
E?(c,u) = and T Joe = mo, 


oo otherwise. 


The following theorem now states that E? is the T-limit of Ey. We also 
obtain that minimizers of Ey. approximate minimizers of Esi, if we take a vol- 
ume constraint into account. The limiting variational problem is a partitioning 
problem taking interfacial energy and elastic effects into account. 

The following theorem has been shown in [Gar00]. 


Theorem 4.1. Assume that the assumptions of YW and W as stated above hold 
and let N) be a bounded domain with Lipschitz boundary. Then it holds: 


1. For all (&*, ue*) zen E€ H? (Q) x Xiq with & —> c in L'(Q) and u®™ > u 
in L?(Q, R”) as eg tends to zero, it holds 


E°(c,u) < lim inf E(u). 

2. For any (c,u) € L'(Q) x X~4 and any sequence cep — 0,k € N, there 
exists a sequence (c&,uc*)pen € H?N) x Xt with c&* > c in L'N) 
and ut — u in L?(2,R") as ek — 0 such that 


E? (c, u) > limsup Epp (c, u). 


k—0o 
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3. Let (c&,u®) be solutions of problem (P€). Then there exists a sequence 
Ek + 0,k E€ N and (c,u) € L'(Q) x Xzq such that 


cr 


>c in L'(2), 
u% >u in Ht ?(2,R") 


and (c,u) is a global minimizer of E°. 


For the proof and for a generalization to the situation of more than two phases 
we refer to [Gar00]. 


4.2 Convergence of the Lagrange multipliers 


For a minimizer (c, u) of E° it can be shown that a constant Lagrange multi- 
plier w exists such that 


2on+v-[WId— (Vu)? Wee] v = 2w. (4.1) 


A minimizer of E? minimizes Esi subject to a volume constraint and w is the 
Lagrange multiplier related to this constraint. 

In the case that no elastic effects are present, we obtain that the mean 
curvature is constant and the term v- |WId — (Vu) W e] v modifies this law. 
In particular the mean curvature can be inhomogeneous along the interface. 
The identity (4.1) and its non-equilibrium analogue (2.13) can be interpreted 
as a generalized Gibbs-Thomson equation. 

Absolute minimizers (cf, u) of Ev, have a constant Lagrange wê which 
fulfills in a distributional sense (see [Gar00]) 


-eA + tW (E) + Wel, E(u’) = o. (4.2) 


In [Gar00] it is shown that the Lagrange multipliers of (c°, u®) converge 
(along subsequences) to a Lagrange multiplier w of the sharp interface varia- 
tional problem. Here we state the result in detail. 


Theorem 4.2. Let Q be a domain with a C1-boundary and assume that V and 
W fulfill the conditions stated above. Furthermore let (E, u°) € H? (Q) x Xj 
be a solution of the variational problem (P®) with Lagrange multipliers w*. 


Then for each sequence (Ek)ken — 0 such that 
Ek 


Erke in LQ), 
u >u in H?(Q,R”) 


it holds 


Ek 


W > W 


>) 


where w is the Lagrange multiplier for the absolute minimizer (c,u) of E°, 
compare (4.1). 
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For a proof we refer to [Gar00]. We remark that although the method of 
Luckhaus and Modica [LM89] for the case without elasticity is used in the 
proof, one cannot follow their arguments in a straightforward way. This is due 
to the fact that not enough regularity is known for the minimizer (cf, u£) of 
Eŝs In the proof of Theorem 4.2 one uses variations of Ej, with respect to 
the independent variables and shows that the resulting Lagrange multiplier is 
related to the Lagrange multiplier from (4.2), which is the first variation with 
respect to the dependent variables. 


Remark 4.3. We also note that a minimizer (c, u) of E? also fulfills 
| do ERP VEN Veal +f (Wid Wawa ves if Acv- é 
Q Q R 


for all £ € C°(2,R”) with €- vo = 0 on OM. Here, v = -iHe is the 


generalized outer unit normal to {c = —1} which is a |Vx{e=-1} -measurable 
function. The above identity is a weak formulation of the modified Gibbs- 
Thomson equation (4.1) (see [Gar00]). 


5 The time-dependent case 


For the evolutionary system we start with a suitable weak formulation of the 
sharp interface problem. Through the limiting process one cannot expect that 
the resulting limit objects are smooth enough such that equations (2.12)— 
(2.14) can be verified in a classical way. Besides concentration, the chemical 
potential and deformation vector which converge quite straightforward in the 
limiting process, to formulate a Gibbs-Thomson law we need both a charac- 
teristic function and a varifold, which represents the interface as motivated in 
Sect. 3.1 including possible phantom interfaces. 

After stating the theorem we give an overview on the proof, not giving all 
the details due to the limited space and refer to [Kwa06] for a full treatment. 


5.1 Statement of the main theorem 


First we specify the notion of a generalized solution of the sharp interface 
model. 


Definition 5.1 (Generalized solution). (M, V,w,u) is said to be a gener- 
alized solution of the modified Mullins-Sekerka problem, if 

M C R x (0,00), w € L7,,(0, 00; H*?(2)),u € L?,.(0, 00; H?(2)") 

V is a Radon measure on N x P”! x (0,00). 

Moreover xm € C°([0, 00); L'(Q)) N L® (0, 00; BV (2)) and 

V* is a varifold on Q for allt > 0 
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such that for all T > 0, for almost every 0 < T < t < T and for all test 
functions Ç € C(O x [0,T)), Y € CE(Q, R”) and X € L2(0,T; H!?(2,R”)) 
the following holds: 


1. fo fol -2xm êt + Vw] = fo 2xmo.,0), 

2. 2 fo xm: div(wY) = (3 Vt, Y) + pee Jo (Wila — (Vu)? St) : DY, 
3. dV* (x, P) = X; pi(2) 6,4 (e) (P)du' (a) dP, 

4. du'(x) 2 20|Dxm: |(x)dz, 

5. Er Sg Wk + Ji Jovo? <u7(2)4+ dy So WE, 

6. JE fo S: DX deat = 0 


where p; € [0,1], 30,05 = 1,5; ri Qvi = Id and p’ is a Radon measure on 
Q. An upper index {.}* denotes the time. 


Remark 5.2. Let us discuss the definition in more detail. The first equation is 
the weak formulation of the diffusion equations (2.11) and (2.12). In the bulk 
the chemical potential will be harmonic. Equation 2 is the Gibbs-Thomson 
law (2.13) in a weak formulation (cf. Remark 4.3 and [Gar00]). Equations 3 
and 4 describe properties of the varifold. Inequality 4 allows that the varifold 
can possibly see phantom interfaces. Equation 5 states the dissipation of the 
free energy and equation 6 states in a weak form that the stress is divergence 
free in the bulk, cf. (2.14), and at the same time one obtains that the normal 
jump of the stress is zero across the interface. 


One should notice that the Gibbs-Thomson law has two terms which rep- 
resent the interface and vanish in the bulk, but the elastic term stays a volume 
integral. The reason for this is that the elastic energy is a non-local volume 
energy. So, one has to be aware in the limiting process that both EV]? and 
(cE) converge to a (n—1)-dimensional measure while W stays n-dimensional. 


Remark 5.3. In the case of Cahn-Hilliard systems, i.e. without any elastic 
terms, equation 2 in Definition 5.1 becomes of the same form as in the Theorem 
3.8 by Schätzle. This means that one can deduce rectifiability of the varifold 
in the case without elasticity, at least for the case p = 2, n = 3. 


Theorem 5.4 (Main). Let the assumptions mentioned in Sect. 3.2 hold. 
Then there is a sequence c; — 0 and a generalized solution (M,V,w,u) as 
in Definition 5.1 such that for all T > 0 


1. & —> —1 + 2xm in CY/9((0, T]; L?(Q)) and almost everywhere, 
2. wē > w weakly in L? (0, T; H'?(2)), 
3. uč > u in L?(0, T; H?(0)”). 
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More precisely the varifold is obtained in the following way. 
Proposition 5.5. For the sequence of Theorem 5.4 it further holds: 
1. There exist Radon measures H, Hki on Q x [0, œ) such that 
e“ (c )dadt > du(x, t), 
cicci Ci dadt > dugi(a, t) (5.2) 


Tk Tı 


both as Radon measures on Q x [0,T] for all T > 0. 
2. For all Y € C§(Q x [0,T],R”) it holds: 


T T 
J (OV",Y) =| [yy : [du(a, t)Id — (dpi; (a, t))aj]. 


Remark 5.6. The first part of the proposition follows easily by the energy 
estimates and using compactness properties of Radon measures. So it is left 
to show that the measures u and uij can be indeed identified as a varifold. 
This is essentially done by proving Theorem 5.7. 


We define for £ > 0 the set 
Ke := {(c,v) € H??(Q) x LQ) | —eAc+ etw (c) = v in R and 
ð c = 0 on ON}. 


Theorem 5.7. There exist a constant no € (0,1] and continuous and non- 
increasing functions Mı(n), M2(n): (0, no] — (0,co) such that for every ņ € 
(0, mo], every £ € (0, Mı (n)7}] and every (c,v) € Ke it holds 


[ED <n f eo temm) f o. (5.3) 


Q 


Remark 5.8. In the application of Theorem 5.7, v will be the sum 


v = wf —-W(c,E(u’)). 


5.2 Convergence of concentration and chemical potential 


One crucial a priori estimate is due to the H~! gradient flow property of 
the Cahn-Larché system with respect to the energy functional (2.5) which we 
already mentioned in Sect. 2.2. For more details see [Gar03]. 


Lemma 5.9. For alle >00 and0 <7 <t it holds 


t 
TOR fve < EȘ;(T). 


From equations (3.3) and (3.4) one easily gets the following a priori esti- 
mates: 
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Lemma 5.10. For alle > 0 and almost allt > 0 it holds 


1. ma Je c(.,t) = mo, 
2. folel* < C+ Eo), 
3. foll- 1)? < CeEp. 


Remark 5.11. The first equation describes one feature of the phase field model: 
conservation of mass over time. This is essentially due to the diffusion which 
is driven by a potential and the Neumann boundary conditions. 


We introduce the auxiliary function 


Paes a ~ \/Ū(s)/2ds, 


which is also known as the Modica ansatz. Here Ŭ (s) := min(W(s), 1 + |s|?) is 
used, so one has 


Cilsi — 82|? < |@(s1) — č(s2)| < C2|s1 — s2|(1 + |s1| + |s21) 
for some C1,C2 > 0. Using this auxiliary function it is possible to obtain 
bounds in BV (2). 
Lemma 5.12. For solutions to the Cahn-Larché system the Modica ansatz 
leads to 
IE || £2 (0,00;222(.2)) + IE lloegr + Ile llo1/8({0,00);£2()) <C (5-4) 


With these uniform bounds one can pass to the limit € — 0 along a 
sequence and together with Lemma 5.10 identify a set M C 2 x [0, co) such 
that we have the following lemma: 


Lemma 5.13. For solutions of the Cahn-Larché system there exists a se- 
quence cj — 0 such that 


e (x,t) > 20xm in C¥/9((0, T]; L1(2)), 
© (x,t) > -1+2xm in C1/9((0,T]; L°(Q)) and almost everywhere 


for all T > 0. 


This set M then defines (2_(t) for all t > 0. 

This proves the first convergence statement of the main theorem. For the 
chemical potential we observe that the following Poincaré type inequality 
holds: 


Lemma 5.14. For the solutions of the Cahn-Larché system we obtain 


lwt (s )Ilan2(a) < CEE + IVW tlla). (5.5) 
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To prove this lemma we test equation (3.4) with X-Vc* for X € Cj (2, R”) 
to get 


[uxve = fae V(XVE) + iW (E)X- VE + We(c%, E(u) ) XV 





= f: (VE: DXVE — $ div X |VE|?) + (4V +W oX- VE. 
Now we see that via partial integration 
fox : (WId) = pawxe = -fx VEW, (5.6) 
fox : (WId) = pavxw =— [x VEW e +XiW epiku] (5.7) 
= -fx VEW o — (OkXi)W ep1 0107, (5.8) 
where we used equation (3.5). With W e,ı = Sx: we obtain 


[ aivwexXer = fox : [eE (E)Id — VE @ VE + WId — (Vus) 9] . 


(5.9) 
We now introduce the mean value of w* as w* and use integration by parts 
to obtain 


[wxve =- f Wut xe- | (t-a) div xa | c div X. (5.10) 
R Q R Q 


Combining equation (5.9) and (5.10), one arrives at 


1 
we = [pea [Px : KE (E) + W(e%, u®))Id -EVE @ Vee — (Vus) sS] 
— Vu* -XE — (wf — w/c div X dz, 


where we choose a smooth X such that f oc divX # 0. Using elliptic regu- 
larity theory, the lemma can be verified. 

With this bound we conclude to weak convergence of the chemical poten- 
tial. 


Corollary 5.15. There exist constants C,eé9 > 0 such that for all € € (0, £o] 
and all T > 0 it holds 
T+1 
| lwt Hll) < C. (5.11) 
T 


Therefore, for a sequence £j — 0 there exists a function w € L*(0,T; H+?(2)) 
such that 
wi — w weakly in L?(0,T; H™?(2)). 
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5.3 Convergence of deformation 


Using the monotonicity of W g£, see (2.3), we obtain that the elastic energy 
density fulfills 
W(c,€) > ColE? — Cr(lel? + 1) 


for some constants Co, Ci > 0. Therefore we have for solutions (cf, u£) 


[rewrarse (is f we ears f Pa). 


Since the W-term is bounded by the total energy Ef, and the c*-term by 
the a priori estimate in Lemma 5.10, we have that ||E(u*)||p2(q) is bounded 
uniformly in t and £. By Korn’s inequality we can also control the deformation 
vector uf in H?(Q)”. 

Since L?(0, T; H':?(Q)") is a reflexive space, we have weak compactness of 
the deformation vector, i.e. for all sequences (£; )jen there exists a subsequence 
(€j,)ken Such that 


ue + u weakly in L?,.(0,00; H'?(2)”) 
for some u € L? (0,00; H4?(2)"). 


loc 
Now we use again the monotonicity of W e to get 


ei\|E(ue — u)|lZ2¢2x0,7)) 


< dees (W,e(c®, E(u’)) = W.e(c, E(u))) . E(ue = u) 


z -f WeC E(u) : E(u® — u). (5.12) 
Qx (0,T) 


The last equality is due to the divergence free stress tensor, cf. (2.4). One 
should notice that only W.¢(c*,E(u‘)), but not W.e(c®,E(u)) is divergence 
free, since only in the former term the respective deformation function uf 
meets the condition (2.4). 

By the strong convergence of the concentration function and the weak 
convergence of the deformation field, the right hand side of equation (5.12) 
goes to zero, i.e. we obtain strong convergence of the strain tensor for the 
sequence (€;,)xen. By Korn’s inequality we have that the deformation vector 
converges strongly in L?,.(0,00; H+?(9)”). Then for almost all t we have that 
Vu‘s« (t) converges strongly to Vu(t) in L?(). 

This verifies the third convergence statement of the main theorem. So far, 
we have shown the convergences as stated in the theorem, but we still have 
to verify, if the limit functions do represent a generalized solution according 
to Definition 5.1. Indeed the first diffusion equation immediately follows from 
equation (3.3) and the convergences of the concentration function and poten- 
tial. The identity 6 in Definition 5.1 follows from (3.5) in the limit € — 0, as 
Vu* and cê converge strongly. The other conditions require the specification 
of the varifold. 
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5.4 The limit varifold and the Gibbs-Thomson law 


This part deals with the limit varifold V. It is mainly derived from the con- 
vergence mentioned in Proposition 5.5 and we show that using Theorem 5.7 
we verify the remaining conditions of Definition 5.1. 

The energy density ef (£) := §|Vc*|?+4W(c*) and eVc* @Vc* are bounded 
by the initial energy: 


T 
i [ete Paa | ppit) dt < TE0 
4 fE (Ver) (Ver);| da dt < L [ee €) dz dt < TEo. 


By compactness there exist Radon measures u and uij according to (5.1) and 
(5.2). But since we also have energy estimates for all times t, we can split 
the measures du(x,t) and duij(x,t) into a spatial and time component, both 
being still Radon measures: 


du(x,t) = dp (a)dt, — dprag (a, t) = dui; (a) dt. 


The energy estimates in Lemma 5.9 show that the energies of the phase 
field solutions are non-increasing. This feature carries through the limit € 
going to zero: 


Lemma 5.16. For a sequence €k — 0 there exists a non-increasing function 
E: [0, œ) — [0,00) such that for all t > 0 


EE: (t) > E(t). 


One has to verify that this function E is indeed the energy of the sharp 
interface model. As mentioned above the interfacial energy converges to a 
Radon measure u. Together with the strong convergence of the deformation 
vector u, the function E can be identified as the energy of the limiting system: 


E(t) +f Poa WilE 


This shows that part 5 of Definition 5.1 is fulfilled in the limit € — 0. 
Equation (5.9) gives in the limit € — 0 


foxo- div(wX) = fox : [duld — (dpiz)ij] + [x : (Wid — (Vu)? S). 
Remark 5.17. The claim is now that f DX : [duld — (dpi; )ij] can be seen as 


the first variation of a varifold. This will prove Proposition 5.5. Hence, part 2 
of Definition 5.1 will be verified. 
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Proof (Proposition 5.5). For Y, Z € C°(Q x [0,7], R”) one gets 


L [xP avetave nas f f ize) +f J zee. 


This means that in the limit € — 0, the last integral is non-positive and 
one gets the inequality 


T T 
| J YT- naz f J iviizian 
0 R 0 Q 


which means that the measures pij are absolutely continuous with respect 
to u. Then there exist -measurable functions vi; such that duij(x,t) = 
vij(x,t)du(x,t) and we get 


T 


Since the matrix (14;);; inherits the symmetry from (5.2), the matrix is pos- 
itive semi-definite and by (5.13) it further holds 0 < (vij); < Id. This 
means one can write this matrix as (Vj;)i; = >o;_) Pivi Q Vi where p; € 
[0, 1], X; vi@v; = Id. Moreover 9°; ñi < 1, since actually for y € C°(2x 0, T]) 


ah [vere vet evel) (VE @ Ve") -f f er (Er) 4 E (ce) 


=|Vc£k |2 


and liMmk—>oo Er tr(Vc® Q Ve") = JX; (via)dp. Recall that the trace of a matrix 
is the sum of its eigenvalues. 


Setting p; := pi + (1 i 5i) € [0, 1] we get 
Id — (vij )ij =Id- ni gv; =J pi (ld- v: 8 vi). 


Thus we can see the limiting varifold as 


dV (a, P) =e pila dv, (P) 
where 6,, is the projection onto the hyperplane normal to 1;. 


5.5 Control of discrepancy measure 


In the case of homogeneous elasticity we have 
|W.-(c,E(u))| < CA + lel + |E) 


which leads W..(c°, €(u®)) to be in L?(Q) for almost all times t > 0. So we 
can follow the proof of Chen in [Che96] for the estimation of the discrepancy 
measure. 

The proof is based on a blow-up technique for which we need some prepara- 
tory lemmas. 
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Lemma 5.18. For each n > 0 there is a constant R(n) > 2 such that for all 
R > R(n) the following holds: 
If i 
N = {(x', £n) € Br | tn > Y (x')} 


is a domain in R”, Y : R"-! — R satisfying 


Y(0') <0, Var¥(0'!)=0', ID}Y llos) < R (5.14) 
and if (c, v) € H?(Q) x L?(Q) with 
—Ac+W'(c)=v in Q, (5.15) 
Ze =0 on {(x', £n) € Br | £n = Y (x")}, (5.16) 
lvllzz(8rn) < E (5.17) 


then the following inequality holds 
J (\Ve|? — 20(c))* < nf (\Vel? + v(e)? + W(c) +v?) 
Bin Bon 


+f . |Vel?. (5.18) 
{ee BiN2||¢e|>1—n} 
Proof. For the proof, which we roughly sketch, one studies the interfacial 
region: 7 7 

2 := {x E By ND | |e] < 1-7}. 


For the case that |{2;| is sufficiently small, one gets by Hélder inequality 
Vell p24.) < 21)" Vell) s Cyl|Vell p1.2(8,n0) 


where m* = 3— 22 with 2* = an for n > 2 and 2* = 7 otherwise. One can 
notice that one 7 appears on the right hand side, which finally leads to the 
statement. For the other case |2,| being large one can use a contradiction 
argument. Through this assumption the homogeneous equation Ac = Y’ (c) 
is recovered. Here one can use elliptic regularity theory to get smoothness of 
the function c. Comparison with viscosity functions yields that c would be 
in fact bounded in [—1,1]. Results by Modica [Mod85] then finally finish the 


proof. 





Now we need a control on the bulk energy of the interface. This is shown 
in the following lemma. 


Lemma 5.19. There exist positive constants Co and no such that for every 
n € (0, no], every £ € (0,1) and every (c,v) € Ke the following holds 


J (e° (c) + +e W'(c Je) 
{xEQ||e|21—n} 


< Con e|Vel? + Coe f v?. (5.19) 
{w€Q||c|<1—n} 2 
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The proof of this lemma is based on the convexity property of Y” for values 
\c| > 1 — n. One combines both 


[v= [ Od 
NQ NQ 


from the equation (3.4) and the Young inequality 


[ws | G+ ee) 


where y% = W’ except in the bulk, so that one has bounds for y” in {|c| > 1—n}. 


Proof (Theorem 5.7). We give a simple sketch of the proof for Theorem 5.7. As 
already mentioned above we use a blow-up technique. The set (2 is covered by 
balls Bre(x;) where R is as in Lemma 5.18. Changing variables to y > xj +ey 
and rescaling vi (y) = evf (x; + ey), one gets the equation 


—A,ci + W'(c!) = v. 


By this blow-up process the right-hand side v can be decreased so much that 
Lemma 5.18 is applicable. Using Lemma 5.19 this ends the proof. If v does 
not fulfill the assumptions of Lemma, 5.18, other elliptic estimates can be 
used. The careful choice of the covering then ensures that by assembling the 
covering the desired estimate is attained. 
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Summary. In this article, we review the progress on phase-field modelling, sharp 
interface asymptotics, numerical simulations and applications to microstructure evo- 
lution and pattern formation in materials science. Model formulations and computa- 
tions of pure substances and of binary alloys are discussed. Furthermore, a thermo- 
dynamically consistent class of non-isothermal phase-field models for crystal growth 
and solidification in complex alloy systems is presented. Explicit expressions for the 
different energy density contributions are proposed. Multicomponent diffusion in the 
bulk phases including interdiffusion coefficients as well as diffusion in the interfacial 
regions are discussed. Anisotropy of both, the surface energies and the kinetic coeffi- 
cients is incorporated in the model formulation. A 3D parallel simulator based on a 
finite difference discretization is introduced illustrating the capability of the model 
to simultaneously describe the diffusion processes of multiple components, the phase 
transitions between multiple phases and the development of the temperature field. 
The numerical solving method contains parallelization and adaptive strategies for 
optimization of memory usage and computing time. Applying the computational 
methods, we show a variety of simulated microstructure formations in complex mul- 
ticomponent alloy systems occuring on different time and length scales. In particular, 
we present 2D and 3D simulation results of dendritic and eutectic solidification in 
pure substances and binary and ternary alloys. Another field of application is the 
modelling of competing polycrystalline grain structure formation and grain growth. 


1 Introduction 


Materials science plays a tremendous role in modern engineering and technol- 
ogy, since it is the basis of the entire microelectronics and foundry industry, 
as well as many other industries. The manufacture of almost every man-made 
object and material involves phase transformations and solidification at some 
stage. Metallic alloys are the most widely-used group of materials in indus- 
trial applications. During the manufacture of castings, solidification of metallic 
melts occurs involving many different phases and hence, various kinds of phase 
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Fig. 1.1. Experimental micrographs of Al — Si alloy samples: a) Grain structure 
with different crystal orientations, b) network of primary Al dendrites and c) in- 
terdendritic eutectic microstructure of two distinguished solid phases in the regions 
between the primary phase dendrites. (See page 683 for a colored version of the 
figure.) 


transitions [KF92]. The solidification is accompanied by a variety of differ- 
ent pattern formations and complex microstructure evolutions. Depending on 
the process conditions and on the material parameters, different growth mor- 
phologies can be observed, significantly determining the material properties 
and the quality of the castings. For improving the properties of materials in 
industrial production, the detailed understanding of the dynamical evolution 
of grain and phase boundaries is of great importance. Since numerical simu- 
lations provide valuable information of the microstructure formation and give 
access for predicting characteristics of the morphologies, it is a key to under- 
standing and controlling the processes and to sustaining continuous progress 
in the field of optimizing and developing materials. 

The solidification process involves growth phenomena on different length 
and time scales. For theoretical investigations of microstructure formation it 
is essential to take these multiscale effects as well as their interaction into 
consideration. The experimental photographs in Fig. 1.1 give an illustration 
of the complex network of different length scales that exist in solidification 
microstructures of alloys. 

The first image (Fig. 1.1a)) shows a polycrystalline Al-Si grain structure 
after an electrolytical etching preparation. The grain structure contains grain 
boundary triple junctions which themselves shown an individual physical be- 
haviour. The coarsening by grain boundary motion takes place on a long time 
scale. If the magnification is enlarged, a dendritic substructure in the interior 
of each grain can be resolved (Fig. 1.1b)). Each orientational variant of the 
polycrystalline structure consists of a dendritic array in which all dendrites of 
a specific grain have the same crystallographic orientation. The third image 
in Fig. 1.1c) displays the interdendritic eutectic structure at a higher reso- 
lution, where eutectic lamellae solidify between the primary dendritic phase. 
In a eutectic solidification of this kind, two distinct solid phases Sı and S2 
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grow into an undercooled melt if the temperature is below the critical eutectic 
temperature. Within the interdendritic eutectic lamellae, a phase boundary 
triple junction of the two solid phases and the liquid occurs. The dendrites 
and the eutectic lamellae grow into the melt on a micrometer scale and during 
a short time period. Once the dendrites and the eutectic lamellae impinge one 
another, grain boundaries are formed. 

Alloy systems with multiple components are an important class of materi- 
als, in particular for technical applications and processes. The microstructure 
formation of a material plays a central role for a broad range of mechanical 
properties and, hence, for the quality and the durability of the material. Aim- 
ing for a continuous optimization of materials properties, the study of pattern 
formation during alloy solidification has been a focus of many experimental 
and, recently, also of computational work. Since the microstructure character- 
istics are a result of the process conditions used during production, the analysis 
of the fundamental correlation between the processing pathway and the mi- 
crostructure is of fundamental importance. Multiple components in alloys are 
combined with the appearance of multiple phases leading to complex phase 
diagrams, various phase transformations and different types of solidification. 
Modelling and numerical simulations aim to predict microstructure evolution 
in multicomponent alloys in order to virtually design materials. However, the 
great number of material parameters and of physical variables involved in sys- 
tems yields a complexity that remains a big challenge for future work. In par- 
ticular, the gain of statistically meaningful data from computations requires 
simulations in sufficiently large domains with a tremendous need of memory 
and computing time resources. To treat complex systems, high performance 
computing, parallelization and optimized algorithms including adaptive mesh 
generators are mandatory. 

The phase-field method has become an important tool for tackling free 
boundary problems such as grain boundary motion [Sea96, GNS99aJ, and for 
simulating crystal growth, solidification and pattern formation phenomena 
in alloys [WMB96, TN*98, BS98, KWC00, PK00, NW00, JGD01, SPGO1, 
BW*02, GBP02, AB*02, KO*03, RB*04]. The advantage of the phase-field 
method lies in the formulation of diffuse interfaces of a finite thickness. Explicit 
front tracking is avoided by using smooth continuous variables locating the 
grain and phase boundaries. By asymptotic expansions for vanishing interface 
thickness, it can be shown that classical sharp interface models including phys- 
ical laws at interfaces and multiple junctions are recovered [Cag89, GNS98]. 
Since phase-field models can be derived on the basis of classical irreversible 
thermodynamics [PF90, WS*93, GNS04], they can be applied to processes 
close to thermodynamical equilibrium, i.e. at relatively small driving forces. 
Extensions of the phase-field approach to describe strongly nonequilibrium 
solidification, solute trapping and solute drag effects at large driving forces 
are discussed in [AW*98, Gal01]. The scaling problem of quantitatively mod- 
elling the low growth rate regime where the microstructure is typically several 
orders of magnitude larger than the microscopic capillary length has been 
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overcome by a so-called thin interface approximation of the phase-field model 
[KR96, A199, KKS99, MWA00, Kav01]. 

The purpose of this article is to extend the advances of the phase-field ap- 
proach to model general multiphase solidification in situations close to local 
thermodynamical equilibrium. The underlying formulation of an entropy den- 
sity functional is given in Sect. 2 and explicit expressions for the free energy 
densities of the bulk phases and of the interfaces are discussed. A method is 
described how the artifical appearance of a foreign phase contributions at a 
two-phase boundary can be avoided. Formulations defining the bulk, interdif- 
fusion and interfacial diffusion coefficients as well as different types of surface 
energies and kinetic anisotropies are presented. In particular, an expression 
of crystalline (facetted) anisotropy is given that can be used for modelling 
general crystal shapes with an arbitrary number of edges and corners in three 
spatial dimensions. The essential ingredients of the phase-field model are sum- 
marized, the numerical method for solving the governing equations is briefly 
explained in Sect. 3. Examples of possible applications to numerically simulate 
dendritic growth, eutectic solidification in binary and ternary alloys, moving 
grain and phase boundaries and the formation of polycrystalline grain struc- 
tures are given in Sect. 4. The simulation results are intended to illustratively 
show the potential of the phase-field model in computing and numerically 
analyzing complex pattern formation. 


2 Phase-field Model for Multiphase and Polycrystalline 
Systems 


The phase-field model allowing for an arbitrary number of phases (or grains) 
and components is derived from an entropy density functional in a thermody- 
namically consistent way, [GNS04, NGS05]. The formulation is defined solely 
via the bulk free energies of the individual phases, the surface energy den- 
sities (surface entropy densities, respectively) of the interfaces, the diffusion 
and mobility coefficients. Thus, the full set of phase-field evolution equations 
is defined by quantities which can be measured. Since the bulk free energies 
determine the phase diagrams (see, e.g., Chalmers [Cha77], Haasen [Haa94]), 
the phase-field model can be used to describe phase transitions, in principal, 
for arbitrary phase diagrams. For a number of binary and ternary alloys, the 
bulk free energies are provided by thermodynamical data base programmes 
and can be directly used in simulation applications. 

The phase-field model for a general class of multicomponent and multi- 
phase (or polycrystalline ) alloy systems is formulated consisting of K com- 
ponents and N different phases (or grains) in a domain 2 C IR. The domain 
NQ is separated in phase regions (2),..., mn occupied by the N phases as 
schematically illustrated in the left image of Fig. 2.1 for the situation N = 4. 
The middle and right images show examples of an Al-Si grain structure with 
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Schematical Drawing Grain Structure Multiphase Solidification 


Fig. 2.1. Left image: Schematic drawing of a domain separation by four different 
phase regions; Middle image: Polycrystalline grain structure in Al-Si; Right image: 
Multiphase solification microstructure with dendrites and a eutectic structure. (See 
page 683 for a colored version of the figure.) 


grains of different crystallographic orientations and of a real multiphase struc- 
ture with primary dendrites and an interdendritic eutectic substructure. 

The concentrations of the components are represented by a vector c(x,t) = 
(ci(x,t),...,¢K(x,t)). Similarly, the phase fractions are described by a vector- 
valued order parameter (x,t) = (¢1(x, t),..., dw (X, t)). The variable a(x, t) 
denotes the local fraction of phase of grain a. The phase-field model is based 
on an entropy functional of the form 


Secgo) = | 


Q 


(se E w(@))) ae 


We assume that the bulk entropy density s depends on the internal energy 
density e, the concentrations c and the phase-field variable @. We require that 
the concentrations of the components and of the phase-field variables fulfill 


the constraints 
K N 
Seah, X pasl. (2.1) 
i=1 a=1 


2.1 Entropy density contributions 


It will be convenient to use the free energy f(T,c,@) as a thermodynamical 
potential. We therefore postulate the Gibbs relation 


K N 
df =-sdT + X` wider + X` radha. 
a=". a=1 


Here, T is the temperature, 4; = fic, are the chemical potentials, and rg = 
f,ġa are potentials due to the appearance of different phases. We set 


e=f+sT, 


and hence 
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de = Tds + >", wade; + $ a radĝa, 
ds = 4de — Ð, Bde; — 1, doa. 
If we interpret s as a function of (e, c, œ), then we have 


— 1 PE S a 0 
‚Ci T’ Pa T` 








We note that given the free energy densities of the pure phases fa(T, c), we 
obtain the total free energy f(T,c,@) as a suitable interpolation of the free 
energies fa of the individual phases in the system. By inserting the free energy 
f into the phase-field method enables to model systems with a very general 
class of phase diagrams. In the way it is formulated, the model can describe 
systems with concave entropies sq(e,c) in the pure phases. This corresponds 
to free energies fa(T,c) which are concave in T and convex in c. In the case 
where f(T, c) is not convex in the variable c, the free energy needs to contain 
gradients of the concentrations (as in the Cahn-Hilliard model). 

Choosing the liquid phase to be the last component ¢y of the phase-field 
vector @, an ideal solution formulation of the bulk free energy density reads 





K 
FAP en= > > (ake ie T = h(a)) +5 (427; In(c)) — cT n(x), 
a=1i=1 i=l 

with LN = 0 and LẸ, i = 1,...,K, a=1,...,N — 1, being the latent heat 
per unit volume of the phase transition from phase a to the liquid phase and 
of pure component i. Furthermore, T7, i = 1,...,K, a= 1,...,N — 1 is 
the melting temperature of the i-th component in phase a, Tm is a reference 
temperature. cy, the specific heat and Vm, the molar volume are assumed to be 
constant, Rg is the gas constant. With a suitable choice of the function h(¢) 
satisfying h(0) = 0 and h(1) = 1, e.g. h(¢a) = $a or h(ba) = $2(3—2¢,), the 
free energy density f is an interpolation of the individual free energy densities 
fa. We can calculate the entropy density 


TE cee TO (ekin a) - $ (Heated) eon) 


a=1 i=1 i=1 
so that 
N K 
See (ec, LE A(gba)) + eT. 
a=1i=1 
We note that if L7 = L“ for all components 7, then e does not depend on c. 


The chemical potentials w;(T, c, @) = RT. c, @) are given by 





N 
yi(T,¢,6) =Y (aa te Moa) ) $ Z Tine) +1. 22 


a=1 
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A more general expression for the bulk free energy density of alloys is the 
Redlich-Kister-Muggianu model of subregular solution 


K K M 
fer = fia + 5 5 CiCj D MY? (ci — 5)", 
v=0 


i=1 j=1 


with binary interaction coefficients uy? depending on the parameter v. For 
M =0, the Redlich-Kister-Muggianu ansatz takes the form of a regular solu- 
tion model. In most applications, in particular to metallic systems, M takes 
a maximum value of two. A ternary term ~ cjcjcp can be added to describe 
the excess free enthalpy. 

The thermodynamics of the interfaces gives additional contributions to the 
entropy given by a Ginzburg-Landau functional of the form 


z. (colo, v0) + tuo) da. 


Here, a(@, V¢@) is the gradient entropy density which is assumed to be homoge- 
neous of degree two in the second variable; i.e., a(¢, nV) = 77a(%, Vo), Yn € 
IR*. The simplest form of the gradient entropy density is 


N 
a(¢, Vo) = X. |Veal?. 
a=1 
However, it has been shown [GNS98, GNS99b] that gradient entropies of the 
form 


a(, V) = X Aas(aVbs — 63V ba); (2.3) 


a<ß 


where Aag are convex, homogeneous degree two functions, are more conve- 
nient with respect to the calibration of parameters in the phase-field model 
to the surface terms in the sharp interface model. 

In liquid-solid and solid-solid phase transitions the anisotropy has a large 
effect on the pattern formation. A choice of the gradient entropies that leads 
to anisotropic surface terms is 


al$, V) = >> Yag acldap)? \daal?, (2.4) 


a<ß 


where Yag represent surface entropy densities and qag = (¢aV¢s—¢8V ¢q) are 
generalized gradient vectors. The formulation using the generalized gradient 
vectors qag allows to distinguish the physics of each phase (or grain) boundary 
by providing enough degrees of freedom. Anisotropy of the surface entropy 
density is modeled by the factor ac(qag)? depending on the orientation of 
the interface. Isotropic phase boundaries are realized by a¢(dag) = 1. Weakly 
anisotropic crystals with an underlying cubic symmetry in 3D can be modeled 
by the expression 
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4 
da 
ac(das) = 1 — Sap (8-42), (2.5) 


with dag being the strength of the anisotropy of the a-8 interface. The 
2 
norms are given by |daalf = Deiat) and Jaagt = (SL1(@)) with 


qi = (baz bs a pea ba): 

The function in Eq. (2.5) for the cubic crystal symmetry is plotted in 
Fig. 2.2a) for an anisotropic strength ô = 0.15. The equilibrium shape of a 
crystal can be determined by finding the inner envelope of the planes perpen- 
dicular to all radius vectors from the center to the surface depicted in Fig. 2.2 
(Wulff-construction). In the directions of the depressions, curved faces evolve 
during growth. 

For a strongly anisotropic crystal of facetted type, we define 


daß r k } 2 
ax Apa Ee) (2.6) 





Ae (dag) oo 


where ng, k =1,...,Nag are the nag corners of the Wulff shape of the a-8 
transition leading to flat crystal faces with sharp edges. These evolve in the 
direction of the cusps. In principal, Eq. (2.6) allows to model arbitrary crystal 
shapes with nag corners. For a comparison with the expression for smooth 
anisotropy in Eq. (2.5), we display in Fig. 2.2b) the function a¢(qag) in Eq. 
(2.6) also for a cubic crystal symmetry. In Fig. c) the simulation results of two 
crystals with 45° rotated orientation growing from adjacent nuclei are shown. 
Each grain develops its minimum energy surfaces in contact with the melt 
and at their interface. 

The interfacial entropy density contribution w(@) is a nonconvex function 
with N global minima corresponding to the N phases or grains in the sys- 
tem. As an extension of the standard double-well potential, one may take the 
standard multi-well potential 


ay (fot SK, b) an oe a c) ; 





Fig. 2.2. 3D surface plot of a) a smooth and b) a facetted cubic anisotropy. c) 
contour plots of two adjacent growing, 45° misoriented cubic crystals applying the 
smooth anisotropy formulation in Eq. (2.5) with 6 = 0.2. (See page 684 for a colored 
version of the figure.) 
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Fig. 2.3. Plot of the multi-well potential ws:(@) for N = 3 and equal surface entropy 
densities yag. (See page 684 for a colored version of the figure.) 


wal) =9 >) Yas $ads (2.7) 
a<ß 
or a higher order variant 
Dst(b) = wal) + X 709562,0505- (2.8) 
a<B<d 


For practical computations the multi-obstacle potential yields good cali- 
bration properties. It is defined by 


Wov() = 5 X Yap Pas, (2.9) 


a<ß 


with a higher order variant 


Wov(P) = wol E) + 5 Yapi pape ps, (2.10) 


a<B<d 


where wo and Ùo are defined to be infinity whenever @ is not on the Gibbs 
simplex. In Figs. 2.3 and 2.4, we show a plot of both expressions, the multi-well 
and the multi-obstacle potential for the case of three phases (N = 3). 

We refer to [GNS99a] and [GNS99b] for a further discussion of the proper- 
ties of the surface terms Wst, Üst, Wop and Woy. We assume for simplicity that 
a(@, V@) and w(@) and, hence, the interfacial contributions to the entropy, 
do not depend on (T,c). 
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Fig. 2.4. Plot of the multi-obstacle potential wos(@) for N = 3 and equal surface 
entropy densities Yag. (See page 684 for a colored version of the figure.) 


2.2 Evolution equations 


The energy and mass balance equations can be derived from the energy flux 


Jo and from the fluxes of the components J1, ..., Jg by 

o 

S =-V-Jo energy balance, (2.11) 

Oc; 

ae ~ -V- Ji mass balances (2.12) 
and are coupled to a set of phase-field equations 

Oba 6S ; 
reba = See —xX phase-field equations, (2.13) 


in such a way that the second law of thermodynamics is fulfilled in an appro- 
priate local version. 

In order to derive the expressions for the fluxes Jo,..., JK, we use the 
generalized thermodynamic potentials $$ = + and 22 = (=), which will 
drive the transition. Now, we appeal to nonequilibrium thermodynamics and 
postulate that the fluxes are linear functions of the thermodynamic driving 
forces vi vi, i Vee to obtain 


Jo = Loo(T, C, pv + D Lot, C, pV 
= Lo(T, c, 8) VF + X3- Loj(T, c, 6) VF, 





Ji = Lio(T, C, ov a ee Li, (T, C, o)V 22 


0c; 


= Lio(T, 6, $) Vp + Dja Lig (T, c, p) V, 
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with mobility coefficients (L;;);,;=0,....«- To fulfill the constraint yy Gg =l 
in Eq. (2.1) during the evolution, we assume 


K 
dH = j=0,...,K, 


which implies 7, J; = 0, and, hence, 0,(37, c) = V- (D Ji) = 0. We 
further assume that L is symmetric (Oraa a relations) and positive semidef- 
inite; i.e., 


K 

D yik 0 VE = g ie) ERE. 

i,j=0 
This condition ensures that an entropy inequality is satisfied. Cross effects 
between mass and energy diffusion are included in the model. One can neglect 
them by setting Lig = 0 and Lo; = 0 for all i, j € {1,..., K}. In general, the 
mobility coefficients (L;;);,;=0,,...,« are allowed to depend on T, c and ¢. Given 
some heat and mass diffusion coefficients, k = k(T, c, @) and D; = D;(T,c, >), 
the mobility coefficients Lij read 


v D;c; 
j j R, ( j FS | ( ) 
for i,j =1,...,K and then recursively define 
Loj = as T Ljih( a)L (2.16) 
a=1i=1 

N,N K,K 
Loo = kT? E Em oe NO hla) LE Ljih( ba) LF (2.17) 

aB ij 


where ĝ;; denotes the Kronecker delta and L® are the latent heats of fusion. 

The formulation in Eqs. (2.15)-(2.17) takes bulk diffusion effects includ- 
ing interdiffusion coefficients into account. The dependence of the mass and 
heat diffusion coefficients on @ can be realized by e.g. linear expansions. To 
also consider enhanced diffusion in the interfacial region of phase or grain 
boundaries, additional terms proportional to fag with interfacial diffusion 
coefficients De (T, C, dag) need to be added. Altogether, we suggest for mass 
and heat diffusion 


N 
Di = X` D? (T, c)ba + SD D$’ (T, ¢, qas)babe, (2.18) 
a=1 


eG 


i 
Mz 


k k°(T, eda, (2.19) 


R 
Il 
m 
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i.e. in particular that the diffusion coefficients in Eq. (2.15) and (2.17) can be 
anisotropic. 

For the nonconserved phase-field variables ġ1,..., mn, we assume that the 
evolution is such that the system locally tends to maximize entropy conserving 
concentration and energy at the same time. Therefore, we postulate 


Teðipa = €(V-4,064(b, V$)—4,4,($, VO) -2l Ate Py, 

(2.20) 

where we denote with a.4,, W,¢,., fd, and a Yoa the derivative with respect 
to the variables corresponding to ġa and Voa, respectively. 

For material systems with anisotropic kinetics, the kinetic coefficient T 
may depend on the generalized gradient vectors qag in a similar way as the 
gradient energies a(@, V@) in Eq. (2.5). The quantity r = r(@, Vo) in Eq. 
(2.13) models an anisotropic kinetic coefficient of the form 


7(, Vo) = T + X` Bas(das) 


a<ß 


with Bap(qag) = 0 if qag = 0. Possible choices are 








4 
Bap = 72g (1 + Cop (3 + alel) -Tm or (2.21) 
daß 
Bag = 723. max { Tap -EF } — To (2.22) 
aP 1<k<rag ld, ae , 


if dag # 0 for weakly cubic (Eq. (2.21)) or strongly facetted (Eq. (2.22)) 
kinetic anisotropies with rag corners Ek a. Cag determines the strength of the 
kinetic anisotropy similar to 6a in Eq. (2.5) for the surface energy anisotropy. 
Systems with isotropic kinetics are realized by setting Cag = 0. 

A is an appropriate Lagrange multiplier such that the constraint yr da = 
1 in Eq. (2.1) is satisfied; i.e., 


= ED [é(¥ erea($.09)—245(8:98)) -Ealo 


a=1 


Futa a 
T 


It has been shown in [GNS04] as a result that inequality 
1 
ay(entropy) = a (sle,e, Ø) ~ eal, V6) ~ wla) ) 


K, ai N 
>-V.- (> = Ji - > ose] 


i=0 a=1 





holds and therefor the derived phase-field equation ensures a positive local 
entropy production. 
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2.3 Non-dimensionalization 


A dimensionless form of the system of governing equations was necessary on 
account of computational efficiency and accuracy. To non-dimensionalize the 
system with respect to time, space, temperature and inner energy density, we 
assume 
x F = J pA 
t[s] = 7* - t, alm] = £- ž, T|K]=%-T, e |— | = (&0) -é, 
m 
where ¢,%,T,é are dimensionless and 7*[s], [m], 0[K], cv[J/m3 K] are refer- 
ence quantities. The notation “indicates dimensionless quantities. Introducing 


dimensionless latent heats L% and a rescaled gas constant R,, the chemical 
potential uj (Eq. (2.2)) follows as 


J 7 . al JIK] | ~ J _ 7 
Hj Fa =(cy,0)-f; with Lẹ | =(cV)-Le, R Fea =UmCy Ry. 





With the dimensionless mobility coefficients (Eqs. (2.15)-(2.17)) Loo, Loj, Lio, 
Lij of the form 














JK Let? ~ m? K 029 ~ 
00 =] = — Loo, Loj | | = — Loj, 
sm Te S 7 
2K 209 ~ 5K Ze s 
Lio 2 | a Lio, Lij ES = aa 
S pF Js Cots 


and with the dimensionless mass and heat diffusivities 


2 2 2A. 
pe |= | = Soe and ke | =| = Se. 








*I| 8 T msK T* 


the energy and mass diffusion equations (Eqs. (2.11) and (2.12)) can directly 
be used with dimensionless quantities. 

The phase-field equations with the surface contributions a(@,V@) (Eq. 
2.4) and w(@) (Eqs. (2.7) - (2.10)) are treated as follows: The surface entropy 
densities Yag, the kinetic coefficients 7 and the interface width £ are scaled as 


J N T 3 
Yap [az | = 0o r=m-# and elm) =é-6 


Inserting these quantities into the phase-field equation (Eq. (2.13)) (both sides 
with the dimension of an entropy density [J/m°K]) and division by c, gives 


FD dba = (Vg 4,(6,V0) — A9.(6.V0)) 


ei - 
-falat 5, a=1,...,N. 


a| rR 


Yo ~ 
fe, Pa ($) 
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Table 2.1. Data set for pure Ni with dimensional and nondimensionalized values 


Parameter Panelni dim. value none value 


melting temperature 
latent heat 
specific heat 


average surface tension 
thermal diffusivity 





When making the choice yo = fc, and T7 = T, the phase-field equation 
retains its original form (Eq. 2.13) with dimensionless quantities instead of 
the dimensional ones. Finally, if we consider the Gibbs-Thomson equation de- 
scribing the motion of a sharp interface with the curvature « (square brackets 
indicate a jump of the respective value) 


(=m a 2 UE 





| (2.23) 


it can easily be seen that the mobility coefficients Bag are equal to the kinetic 
coefficient 7 in the phase-field equation (Eq. (2.13)). 

The length scale parameter @ can be related to the size of the domain 
resolved with N, grid points. In the case of the pure Ni system, the size of 
a thermal dendrite is Nx - l- AZ = 2 . 1075m. With a feasible number of 
gridpoints N, = 500 and a dimensionless cell spacing AZ = 1.0, we have 
l = 4.0-10-§m. Using thermophysical data for pure Ni from [BK*02], we 
obtain the nondimensional data set listed in Table 2.1. 


3 Numerical Methods 


In the previous section, a general multi-component multi-phase-field model 
was formulated leading to a system of coupled partial differential equations 
(Eqs. (2.11) - (2.13)). In the general case, there will be one energy equation, K 
concentration equations and N phase-field equations to be solved numerically. 

Our numerical algorithm for solving the general system of equations is 
based on a finite difference scheme with explicit time update on a regular grid. 
Despite its disadvantages concerning stability and limited time step size, this 
straight forward approach is justified by the applicability of various sophisti- 
cated techniques to reduce the computational effort and to save memory. This 
becomes especially important when treating problems with a high number of 
phases N as in the case of grain growth. Further, the coupling of the numer- 
ical scheme with finite difference solvers for fluid flow or elasticity is greatly 
facilitated (the approach of coupling to a Navier-Stokes solver is specified in 
[SNW05]). In the following, the discretization of the balance equations (Eqs. 
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(2.11) and (2.12)) and of the phase-field equation (Eq. (2.13)) is considered 
in detail. 

The phase-field variable ġa defines the smallest length scale and the largest 
spatial gradients due to its rapid change from 0 to 1 over the width of the inter- 
face. When using a regular grid, it must be chosen fine enough to adequately 
resolve the diffuse interface layer of the phase-field variables. Depending on 
the kind of spatial discretization, it should extend over at least 5 to 10 grid 
points. The diffusion lengths for the temperature T and for the concentrations 
ci are much greater than the interface width, so that one could consider the 
application of lower grid resolutions for these conserved fields. We anticipate 
this option to keep a higher accuracy, since over the diffuse layer in general, 
both, the phase-field as well as the diffusion fields may change rapidly. 


3.1 FD discretization and staggered grid 


The complete set of evolution equations of the model (Eqs. (2.11) - (2.13)) can 
be treated in the following simplified divergence form, where uz, Um stands for 
the respective field quantity (l, m indicative for e, ci, da): 


Ou, 


= cum) (3.1) 


Um; Ou; 
a 


=rhs =V-£( 


All terms including field variables and their spatial derivatives are arguments 
of the vector valued function f. In case of the phase-field equation (Eq. (2.13)) 
the right hand side (rhs) contains additional source terms, which in general 
are small by a value compared to the divergence term, so that the general 
character of Eq. (3.1) is kept preserved. Concerning the time discretization, 
an explicit forward Euler scheme is applied with a time update according to 
ut? = u” + At- rhs” (time steps are superscripted). This explicit scheme 
requires a control of the temporal step width At for each individual equation. 
For the case of a 3D simulation and an identical grid step width Az in each 
space dimension, the criterion for stability would suggest a stepwidth of 


At < min Ae Ast Aa” 

= 6kmax 6Dmax’ 6228 
where kmaz and Dmag are the maximum values for all heat and mass diffu- 
sion coefficients, Yag is the maximum surface entropy coefficient among all 
appearing phases and To is the kinetic coefficient. 

Since the right hand sides of the conserved order parameter equations (Eqs. 
(2.11) and (2.13)) as well as the first term in Eq. (2.13) consist of a divergence 
term, a two-step algorithm was developed: First, the vector flux quantities are 
calculated using right-sided finite differences and they are stored in a memory 
buffer for multiple access. This buffer holds the flux values of three adjacent 
2D-layers during the layerwise calculation, shifted through the 3D grid along 
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the z-direction. In a second step, the divergence is evaluated using left-sided 
differences. This results in an extremely memory saving numerical scheme 
without any redundant calculations, since for each field variable only a single 
3D array must be stored in memory ([WN06]). The spatial discretization of 
the phase-field and of the balance equations differs and is described in the 
following paragraph. 


Balance equations: 


In the nonlinear energy and mass diffusion equations, the physical diffusion 
coefficients are incorporated in the Onsager coefficients Lij, which may depend 
on ¢, cand T. The discretization on a regular grid with spatial indices (i, 7, k) 
for the following simplified energy equation (mass diffusion cross terms have 
been omitted for clarity): 


Oe 1 


an -V-Jo=—-V- (Loot, C, #¥(z)) (3.2) 


is accomplished in the FTCS scheme (forward in time, centered in space). 
First, to compute the divergence on the right hand side of Eq. (3.2), all com- 
ponents of the energy flux vector are assembled. For a grid cell with indices 
(i, j, k) the spatial derivatives are approximated with right sided finite differ- 
ences, and the Onsager coefficients Loo are evaluated at the respective inter- 
mediate grid positions, in the center boundary of two adjacent grid cells (see 
Fig. 3.1b)). For example, the z component of the energy flux reads: 




















ð /1 1 
i =(=) ~L Di (=) 3.3 
Oa \T i,j,k o9 i+h,jk ~ \T/ li j,k (3:3) 
with 1 
Loolisa j,k = 5 Loos ie + Lool; 5.) 
and 
wÈ) -Gl -4 
PAT ligne Ax \Tli+i jk Tlijk J’ 


where D+ indicates the forward difference operator in x direction. As a next 
step, the divergence operation is carried out using left sided finite differences 
of the flux components in Eq. (3.3). The resulting spatial-temporal scheme 
has an accuracy of convergence of order (Ax)? in space and of order At in 
time. 


Phase-field equations: 
For the phase-field equation (Eq. (2.20)), the correct treatment of anisotropy 


in the gradient entropy density a(@,V@) is important, especially when re- 
ducing the numerical interface to a desirable low number of grid points. 
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Fig. 3.1. a) Schematical view of a two-phase region with diffuse interface (shaded) 
and generalized gradient vector qag. b) 2D sketch of the finite difference grid with 
the field variables $,c,T at central positions (0), q-vectors and with transport 
coefficients Lmn at staggered positions (x). 


It is convenient to keep the formulation using generalized gradient vectors 
dos = ¢aVOg — Vha (Fig. 3.1a)) in the solution algorithm. We take 
a(@, V@) in the form of Eqs. (2.3) and (2.4) and carry out the variational 
derivatives with respect to Va and ġa to get the two anisotropic rhs terms 
of the phase-field equation, namely: 








Aas (_ 
a, Vba = 5 (3.4) 

bža 
-5 EVs (3.5) 

TE 
wig, Aag ( dae, ». 9 

with =2 a6 | te(das)— |dael* + aldag) dag |. 3.6 
Dds Yap | acq P) gag (2 al (das) das (3.6) 


The entropy flux term a y, in Eq. (3.4) needs a special attention due to 
the influence of the anisotropy function a-(qag). Three different vectors qg, 
qe B and gig are computed for each grid point, evaluated at the upper side 
x, y and z boundaries of the cell volume. The spatial derivatives of ġa on 
these staggered grid positions include a combination of right-sided and cen- 
tral differences of the neighboured grid positions (see Fig. 3.1). To compute 
each spatial component of the vector valued function a yga, the respective 
vector q777 is used. The divergence of avg, appearing in Eq. (2.20) is cal- 
culated subsequently with left sided differences, taking advantage of the same 
3-layer buffer mechanism as described above. For the second anisotropic term, 
a $a, the gradient vectors qag are evaluated at central positions using central 
differences of ġa. 
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We find that by this treatment (equivalent to a 16-point stencil) a second 
order accuracy in space is obtained. For the 2D case the discretization is 
equivalent to the scheme published in [And00]. If no anisotropy is present, 
dag can be calculated exclusively with right-sided differences of ġa within a 
7-point stencil, in this case sufficient to guarantee second order accuracy. 


3.2 Optimization of the Computational Algorithm 


The phase-field model has been implemented in its complete generality for 
simulations in 2D and 3D describing phase transitions with energy and mass 
diffusion in alloy systems for an arbitrary number of components and phases. 
An extensive program package (PACES8D: Parallel Algorithms for Crystal 
Evolution in 3D) was developed in the programming language C due to perfor- 
mance reasons, but uses an object oriented approach; a data base is provided 
to choose different formulations of free energies, anisotropies, potentials, mass 
and heat diffusion functions. Moreover, artificially generated noise of vari- 
ous distributions can be added to the phase-field variable (non-conservative) 
or to energy or concentration fields (conservative noise). Several adaptive 
strategies reduce the computational effort: 


e The equations for the phase fields are exclusively solved in their respective 
diffuse interface area by the use of an activation flag field. This field is set 
in each time step by an optimized gradient test routine. 

e A dynamical memory concept reduces the memory costs especially for a 
high number of phase-field variables: For the majority of the grid points, 
da, assumes values of 1.0 or 0.0 in all regions of pure bulk phases and can 
be referenced by pointers to constant unit vectors. 


Since both strategies are especially effective with a narrow interface profile, 
we favorize the use of the multi-obstacle potential in Eq. (2.9). This potential 
has a strong separating character and reduces the number of interface grid 
points (with 0 < i jk < 1). 

For most alloy systems, there are great differences in the thermal and mass 
diffusivities leading to differences in the evolutional time scale. If the Eqs. 
(2.11) and (2.12) are solved conjointly, the stability criterion demands for the 
use of the small time step width which is usually the heat diffusion scale. 
To reduce the computational effort, different step widths, integer multiples of 
the smallest (heat diffusion) scale, are used to solve the three kinds of Eqs. 
(2.11),(2.12) and (2.13). 


3.3 Parallelization 


Two parallelization concepts of the finite difference algorithm are integrated 
in the source code and can be chosen aside or in combination: For high perfor- 
mance computing on Linux clusters, distributed computing is realized via 
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the (LAM-) Message Passing Interface (MPI) routines. The approach splits 
the three-dimensional simulation grid into multiple sub-grids, so that each 
available node gets assigned a specific part of the simulation space. To avoid 
unnecessary complexity of the code and to reduce boundary data exchange, 
the simulation is exclusively subdivided along the z-direction. In this way, we 
have minimal interdependencies between neigbouring simulation subdomains 
and a maximum speedup factor, a result of extensive software tests. By map- 
ping the number N, of grid points in z-direction to its value n, in each node 
subdomain, we are able to use the same code base as well as simulation de- 
scription for serial as well as distributed simulations, effectively decoupling 
the simulation from the hardware executing it. An MPI-based boundary ex- 
change mechanism ensures that before each simulation time step commences, 
the outer planes of each node are copied into the excess space, provided by 
its immediate neighbour. A more extensive description of the parallelization 
can be found in [NW*05] 

An additional approach to exploit the power of multiprocessor worksta- 
tions and supercomputers was realized by shared memory parallelization 
via OpenMP: the execution of the spacial loops is subdivided into different 
threads running on different processors of a single node. Therefore, only slight 
modifications of the code are necessary, e.g. the introduction of indexed loop 
variables. 

An important feature for parallel phase-field simulations is the implemen- 
tation of an appropriate load balancing mechanism. Due to the advancing 
fronts, which require more calculations, the demand for computational power 
is locally non-uniform. In our case, this was achieved by performing statistics 
of the load of each node and of the data transfer times. A redistribution of 
the simulation area after an optimal number of time steps is initiated. Addi- 
tionally, to adapt the code to networks with lower capabilities requiring data 
compression, we evaluate run-length and quadtree encoding of the exchanged 
boundary data. 





4 Applications 


To give an impression about the applicability of the model and simulation 
tools, a selection of different microstructures examined in more detail is given 
below. Here our focus lays on solidification processes in pure (Ni), in binary 
alloys (Ni/Cu) and on the description of grain growth phenomena. In the 
simulations, the complete binary coded data for temperature, concentrations 
and phase-field parameters is stored in floating point accuracy and analyzed 
a posteriori. To tackle the problem of 3D data analysis, a complete software 
toolbox was developed, including means for visualization, determination of 
front velocity, temperature and gradients, calculating contour lines, one and 
two dimensional intersection profiles and growth rates. 
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Fig. 4.1. a) Phase-field profile of a planar front growth in an undercooled melt (pure 
Ni) for high grid resolution (double well potential). b) Phase-field and temperature 
profiles with reduced resolution ensuring sufficient accuracy. 


4.1 Parameter Tests 


To verify the accuracy of the numerical method, different grid sizes and pa- 
rameter tests were carried out for the Ni and NiCu data sets. The predictions 
of the Gibbs-Thomson equation (Eq. (2.23)) for the growth of a planar front 
resp. a spherical shape with radius R were examined. For these simple cases, 
where the curvature « is equal to 0 resp. to 1/R, the tests gave the predicted 
growth velocities v by Eq. (2.23) very closely. Finite size effects of the numer- 
ical grid were verified over a long simulation period by evaluating the shape 
deformations of a spherical solid in equilibrium with the surrounding melt. 
With the lateral and diagonal radii Rio and Ri; from the simulation the grid 
anisotropy grid = Foon was evaluated. Grid anisotropy was found to be 
always much below the anisotropy strength 6 for sphere diameters greater 
than 20 grid points with the Ni dataset (Table 2.1). 1D simulations of planar 
front solidification with a diffusion field were used to optimize the values of 
the grid spacing Az and of the diffuse interface width € with respect to an 
agreement with the analytical solution. 

The calculated phase-field profile corresponds well with the expected 
$(1 — arctan(x — xo)/e) shape (Fig. 4.1a)) and the temperature profile fol- 
lows a erfc(x—2q) form (Fig. 4.1 b)) as expected from literature (cp. [Dav01]}). 
zo denotes the front location. As a result the diffuse interface is adequately 
resolved with about 5 to 8 grid points to produce correct results. 


4.2 Dendritic Growth 


The 2D and 3D simulations of dendritic solidification in pure Ni were accom- 
plished with the dimensionless dataset of Table 2.1. In the simulation runs, a 
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b) 














Fig. 4.2. a) A dendrite in a channel of width l with tip radius R and with a finger 
width A (J = 15 um); b) liquid finger growth for a narrow channel with l = 1 um. 


small amplitude of gaussian distributed noise was added to the front, strong 
enough to initiate the evolution of dendritic side arms. An example for this 
pattern in 3D, an equiaxial Ni dendrite, is given in Fig. 4.5c). 

Real alloy microstructures are determined by the competitive growth with 
a high number of initial dendrite tips, often starting from a planar configu- 
ration. A quick selection of the tip spacing due to the retarding effect of the 
emitted latent heat at the front follows, so that only a few dendritic fingers 
survive. 2D simulations to study the influence of the dendritic spacing l, of the 
undercooling A = cp(Tm — T)/L and of the crystal orientation were carried 
out. In order to mimic the situation of an array of dendrites, a single nucleus 
was confined in a long simulation domain with isolation boundary conditions 
at the bottom and periodic conditions at the long sides. The calculations were 
executed until the tip velocity approximated an asymptotic value, i.e a steady 
state was reached. The microscopic solvability theory predicts the operating 
conditions for a steady state dendrite tip with radius R (inversely proportional 
to the tip curvature) and tip velocity Vtip [BM91]. In Fig. 4.2 a) the geometric 
parameters of a channel dendrite: the tip radius R, the finger width À and 
the channel spacing l are shown. The tip radius was determined by fitting 
a parabola to the tip region, as indicated by small circles. In a first series 
the spacing was varied over more than one order of magnitude at a constant 
undercooling A = 0.4. The grid resolution therefore was adapted to avoid 
finite size effects for narrow channels. The relation of the finger width and of 
the channel spacing \/! shows a clear increase from an asymptotic value of 
A/L = 0.4 for large spacings to A/L = 0.85 at 1 = 1.0 wm (cp. Fig. 4.3a)), 
which can be explained with the increasing surface tension in a narrow chan- 
nel. This can also be seen in Fig. 4.3b), where the tip radius R deviates at the 
left side from the strictly logarithmic characteristics for wider channels. Also 
we observe for narrow channels an unsteady evolution of the tip and a viscious 
finger like behaviour of the interface shape (Fig. 4.2b)), in accordance with 
an earlier phase-field study on a model data set [SPGO1]. 

The orientation dependency, an important question when considering sit- 
uations with more than one nucleus, is essential for selection and overgrowth 
phenomena during grain formation processes in alloy systems. The effect on 
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Fig. 4.3. Variation of the dendritic spacing l. a) dendritic finger width À and 
fraction A/l, b) tip radius R (dashed line: value for an equiaxial dendrite). 


the growth velocity v for an increasing orientation angle between the den- 
drite and the long channel direction is given in Fig. 4.4a). The profile of the 
dendrite becomes asymmetric with side arms starting to evolve exclusively in 
the front direction. At an angle of more than 40° two dendritic tips start to 
compete, so that in a narrow channel a transition to cellular growth takes 
place (Fig. 4.4 b and c). 

Some examples for dendritic array simulations in 3D are shown in Fig. 4.5. 
A single dendrite in a 7.5 um channel (a) develops an approximately quadratic 
cross section. Chart b) presents an array of dendrites emerging from a rough 
front with a 15° inclined orientation with respect to the surface normal. Due 
to the compact growth pattern, only a minor side arm formation appears as 
compared to the equiaxed single nucleus in chart c) (domain: 15 um). 

For comparison, the case of solutal growth in a Nio.59Cuo.61 melt with 
an isothermal undercooling of 20 K below solidus is shown in Fig. 4.6. The 
dendrite evolves with a pronounced side arm formation. The dependence of 


a) b) c) 





| 


Fig. 4.4. Variation of the orientation angle ¢. a) tip Vtip and front velocity vfront = 
Vtip cos(@). Contour plots of growing dendrites with an orientation of 45° in a narrow 
channel (b)) and in a wide domain (c)). 
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Fig. 4.5. 3D Ni dendritic growth at an undercooling of A = 0.6. a) single chan- 
nel dendrite, b) dendritic array with an orientation inclination of 15° with rsp. to 
normal, c) equiaxial dendrite. (See page 685 for a colored version of the figure.) 


the tip velocity on the channel width is negligible when exceeding a critical 
width, since the solutal diffusion coefficient is three to four orders of magnitude 
lower than the thermal coefficient in pure Ni. However, for NiCu a strong 
dependence on the growth orientation ¢ (anisotropy direction) was found. The 
plot of the tip velocity Vtip in Fig. 4.6 b) indicates the occurance of a minimum 
at an angle of 15°, whereas the actual front velocity vfront = Vtip cos(@) shows 
only minor changes within a range of 15° to 35°. 
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Fig. 4.6. Channel growth for the Nio.59Cuo.61 system. a) Single dendrite arm (unro- 
tated) with Ni concentration map. b) Dependancy on orientation: tip Vtip and front 
velocity Ufront = Vtip cos(@). 
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4.3 Eutectic Growth 


In the following, microstructure simulations of binary and ternary phase trans- 
formation processes are shown to illustrate the wide variety of realistic growth 
structures and morphologies in multicomponent multiphase systems that can 
be described and investigated by the phase-field model. Fields of applications 
are eutectic grain boundary formations and structure evolutions in ternary 
systems [RMC03] as well as eutectic colony growth involving ternary impu- 
rity effects [AF00] which will be shown by the following examples. 

To perform the simulations in Figs. 4.7-4.13, we considered a ternary eu- 
tectic model alloy of three components A, B and C (i = 1,...,3), three solid 
phases a, 3, y (a = 1,...,3) and one liquid phase L (œa = 4). We non- 
dimensionalized the model equations (Eqs. (2.11-2.13)) and, for initialization 
of the computations, chose the following parameter set: Equal grid spacings 
for the two/three coordinates at a value Ax = 0.01, a diffuse interface thick- 
ness € = 0.05, surface entropy densities yg = 0.001, an isotropic kinetic 


a 


ses 


completely symmetric phase diagram with dimensionless data for the latent 
heats of fusion L% and for the melting temperatures T“: 


1.47 1.00 1.00 0.00 
(LS) i=1,...2 = | 1.00 1.47 1.00 0.00 (4.1) 
1.00 1.00 1.47 0.00 


1.50 0.50 0.50 0.00 
a) 1.3 = | 0.50 1.50 0.50 0.00 | , (4.2) 
0.50 0.50 1.50 0.00 


where a = 4 is assumed to be the liquid phase L. As a result, the phase 
fractions of the three solid phases at the ternary eutectic temperature are 
equal. Further, we considered the solidification process under the condition of 
isothermally undercooled melts. 

In Fig. 4.7, the formation of two eutectic grains in the binary A— B ’edge’ 
system of initial composition (c4, cg, cc) = (0.5,0.5,0.0) has been simulated 
in a 2D domain of 270 x 540 grid points. The simulation involves pattern 
formation on different length scales. On a larger scale, grains with different 
orientations due to anisotropy of the surface entropy densities Yag grow and 
form a eutectic grain boundary. To include anisotropic effects, we used the 
facetted formulation of Eq. (2.6) for a cubic crystal symmetry and defined 
two sets of four corners for the upper and for the lower grain, whereas the 
corners of the lower grain are rotated by 10° with respect to the growth di- 
rection. On a smaller scale, a lamellar eutectic substructure solidifies: Below a 
critical eutectic temperature Te (here Te = 1.0), a parent liquid phase L trans- 
forms into two solid phases a and ĝ in a binary eutectic reaction: L —> a+ p. 
The white and light grey colored regions as well as the black and dark grey 
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Fig. 4.7. Growth of two eutectic grains (white/black and light/dark-grey) of a 
binary A-B alloy with different crystal orientations into an isothermally undercooled 
melt (continuous grey scale) at four time steps. 


colored regions represent the same solid phases, namely a and 8, with just 
a different orientation. The results illustrates the capability of the model to 
distinguish several phases and grains at the same time. The images visualize 
the phase evolution and the concentration profile of the alloy component B in 
the liquid ahead of the growing solid phases at different time steps. Concen- 
tration depleted zones occur in dark grey and concentration enriched zones 
appear in light grey. 

Fig. 4.8 shows in a) 2D and in b) 3D simulations results of regular eu- 
tectic lamellae growing from an undercooled melt with initial composition 
(ca, CB, Cc) = (0.5, 0.5, 0.0). In a symmetric phase system, the two solid phases 
grow with equal phase fractions. At off-eutectic composition c = 0.4, the vol- 
ume fractions of the two solid phases are different. The proportions of the 
phase fractions in our simulations can well be related to the classical lever 
rule. Next, we investigate the widely observed phenomenon of regular oscil- 
lations along the solid-solid interface driven by the motion of the triple junc- 
tions, see Figs. 4.9 and 4.10. A transition from stable lamellar growth to an 
oscillatory structure is found for varying initial phase fractions at the eutectic 
composition. A characteristic amplitude and wave length of the oscillation is 
established during solidification. The 3D microstructure performs an alter- 
nating topological change from a-solid rods embedded in a -solid matrix to 
(-solid rods embedded in an a solid matrix and so on. 

Depending on the position in the phase diagram, ternary alloy solidifica- 
tion may involve phase changes of four different phases and diffusion of three 
alloy components A, B and C. At the ternary eutectic composition, three solid 
phases grow into an undercooled melt via the reaction L — a+6+/7. While si- 
multaneously growing, the solid phases mutually enhance each other’s growth 
conditions as they reject opposite components of the alloy into the liquid. We 
have set an equal initial composition vector of (c4, cg,cc) = (0.3, 0.3, 0.3). 
For isotropic phases, this leads to very regular lamellar structures as those in 
Fig. 4.11a) and b). The three images in a) display the concentration fields of 
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Fig. 4.8. Establishment of regular lamellar solidification at the eutectic composition 
in 2D (a) and 3D (b). (See page 685 for a colored version of the figure.) 
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Fig. 4.9. Regular oscillations along the solid-solid interface driven by the motion 


of the triple junction/triple line in 2D. (See page 685 for a colored version of the 
figure.) 


the three components A, B and C in front of the growing eutectic lamellae with 
a phase sequence of a|G|a|y|a|... at the same intermediate time step. It can 
be observed that the white aœ phase consumes component A from the melt and 
pushes components B and C into the melt. The respective process happens for 
the two other solids 8 and y. For comparison of the diffusion fields, Fig. 4.11 b) 
shows the concentration of C for a phase sequence a|3|y|a|... By performing 
phase-field simulations, the stability of different phase sequences for varying 
solidification conditions can be investigated. The diffusion processes of the 
three components are illustrated in a 2D domain of size 200 x 200. 

Fig. 4.12 shows a time sequence of a 3D simulation of ternary eutectic 
solidification in a computational domain 60 x 90 x 90. The computation was 
initialized with cubic crystal shapes. During the evolution, a regular hexagonal 
structure of the three isotropic solid phases with 120° angles between the solid 
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Fig. 4.10. Topological change of the microstructure due to oscillations along the 
solid-solid interface in 3D. (See page 686 for a colored version of the figure.) 


212131 3 


Fig. 4.11. a) Concentration fields c4,cg and cc of a ternary eutectic lamellar 
solidification front with a solid phase configuration (a|G|a|y7| ...). b) Concentration 
field cc for a phase sequence (a|f|7| ...). 











Fig. 4.12. Formation of a 3D hexagonal rod-like structure in a ternary eutectic 
system with isotropic surface energies and three different solid phases a, and y. 
(See page 686 for a colored version of the figure.) 


phases is established as steady growth configuration in 3D in analogy to the 
lamellar structure in 2D. This symmetry breaks if anisotropy is included. 
The simulation in Fig. 4.13 was conducted with an initial composition vec- 
tor of (c4, cB, cc) = (0.47, 0.47, 0.06) so that the concentration component cc 
acts as a ternary impurity of minor amount. As can be seen in the first two 
images, the solid phase a in white color is formed by using up the concentra- 
tion c4 whereas solid phase 8 rejects A atoms. If a y solid phase containing 
cc as its major composition is introduced, it is instable and immediately dis- 
solves for these concentration proportions. Neither the a phase nor the 8 
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Fig. 4.13. Simulation of lamellar eutectic growth in a ternary system with an 
impurity component cc: The concentration profile of the main component ca in melt 
is shown in the left and centered images for two time steps. The ternary impurity cc 
is pushed ahead of the growing eutectic front so that concentration enriched zones 
of component cc can be observed at the solid-liquid interface in the right image. 


phase engulfs the concentration cc so that it increases all along the solid- 
liquid interface. The simulated evolution process recovers the experimentally 
observed effect that the impurity becomes enriched ahead of the solidifying 
lamellae and builds up. At larger computational domains, we expect the effect 
of cell/colony formation to occur. 


4.4 From Solidification to Grain Coarsening 


A problem with a high relevance to technical applications is the evolution of 
polycrystalline alloy microstructures consisting of many grains with individ- 
ual orientations. Solidification as well as grain boundary and concentration 
reorganization are successive steps in casting respectively heat treatment. 
The influence of crystal anisotropy is very important when examining non- 
equilibrium structures such as dendrites or the interaction of different crystal 
grains in a polycrystal. In our model, we treat each grain as an individual 
phase with its own orientation, given by three angles of rotation with respect 
to the coordinate axes. This approach is not the computationally most effec- 
tive one as compared to two-phase models with an additional misorientation 
variable (cp. [GP*04]), but offers benefits with respect to accuracy and pa- 
rameter choice: if the angular dependence of the surface free energy is given, 
correct surface energy contributions are incorporated and no additional un- 
known model parameters, such as the mobility of reorientation, are necessary. 
Concerning the numerical realisation in the model with N phases, an N x N 
matrix is automatically generated at simulation start and keeps the informa- 
tion of the orientation relations of each a—( interface. More precisely, a special 
anisotropy is not related to a phase with a special orientation, but rather to a 
phase boundary. This has the great advantage, that each a-8 phase boundary 
can be related to an individual surface energy as well as to an inclination 
dependence. Both parameters: the misorientation of two grains and the in- 
clination of the grain boundary with respect to a coordinate system fixed in 
one of the grains, affect the surface energy (see Kazaryan et. al. [KW*00]). In 
the phase-field equation, only two additional matrix multiplications, one for 
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Fig. 4.14. Treatment of multi-phase anisotropy: For three solid phases a, @ and y, 
the anisotropy with respect to the liquid as well to each solid phase must be defined. 
Numerically, the qag vectors are rotated by the misorientation angle 02 — 0g and 
then used in the phase-field equation. 


rotating the qag-vector and another for back-rotating the resulting entropy 
flux vector a,v¢, (Eq. (3.4)) enter the program sequence. Fig. 4.14 gives an 
overview of this relationship. 

For the simulations presented in the following, the NiCu data set from 
Sect. 4.2 with identical initial conditions (melt composition: Nio.s9Cu41, 20 
K below solidus) were taken. First, the selection of dendritic grains with 10 





Fig. 4.15. Selection process in a polycrystalline dendritic front: The colours in a) 
and b) indicate the orientations of the dendrites for two different time steps, whereas 
in c) the Ni concentration is shown. (See page 686 for a colored version of the figure.) 
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Fig. 4.16. Three time steps of two misaligned NiCu grains in 3D starting from a 
rough planar initial state (lateral periodic boundary conditions). (See page 687 for 
a colored version of the figure.) 


different orientation orientations over the range of [—45°, +45°] starting from 
the bottom wall with periodic horizontal boundary conditions is depicted in 
Fig. 4.15. A rapid overgrowth of the more vertically aligned orientations takes 
place. The result is a nearly 45° misaligned growth direction of two surviving 
dendrite branches. A similar process with two differently oriented, initially 
planar and rough grains is shown in Fig. 4.16, where the grain with a stronger 
inclination with respect to the vertical axis (lighter grey) overgrows the second. 

As a final example, equiaxial grain growth processes for Nip.59Cuo,41 poly- 
crystals and for a random initial distribution of nuclei are discussed. The evo- 
lution of the grain structure combines the growth of many differently aligned 
crystal seeds into the same melt reservoir and the final interaction of the fully 
solidified individual grains. In Fig. 4.17, three time steps of the solidification 
process in 2D are shown on a computational domain of 800 x 800 grid points. 
The release of Cu in front of the interface slows the phase transformation 
down. The fully solidified crystals show a low Ni concentration at the grain 
boundaries, frozen in due to the very slow solid-solid mass diffusion. The ana- 
log situation in 3D is given in Fig. 4.18a) and b) with about 30 separated 
grains. Here the level sets a = 0.5 are displayed for 9 grains. 

When raising the temperature of this grain structure in a subsequent pro- 
cess (Fig. 4.18c)), but keeping it below the solidus temperature of the equi- 
librium composition, the structure starts to melt along the edges separating 
three or more grains. When continuing this processing, about 30 % of the 
grains shrink and vanish, whereas the rest grows and the remolten liquid 
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Fig. 4.17. Growth of dendritic NiCu grains into a 20 K undercooled melt illustrated 
by the Ni concentration (range: 0.41—0.62). The complete solidification (right image) 
is reached after further reducing the temperature by 15 K in a second step. (See page 
687 for a colored version of the figure.) 


fraction gradually disappears. A reduction of the interface curvature and a 
minimization of grain boundary energy is the driving force of the process. 

In addition to the visualization, the diffuse phase-field data can be used 
for various analysis purposes of the material properties. An application useful 
for the extraction of morphological information which is motivated by crys- 
tallographic texture analysis is presented. The surfaces of a grain in contact 
with neighbours of different orientation evolve due to local curvature, kinetic 
and surface energy anisotropy. Since the surface normals can be computed by 
Vda/|bal, a stereographic projection on the equatorial plane reveals infor- 
mation about prefered faces or shape distributions. In Fig. 4.19 an originally 
spherical grain in a matrix of a second phase with strong cubic anisotropy 
develops an octahedral shape with distinct faces. This corresponds to the for- 
mation of maxima (bright spots in Fig. 4.19a)-c)) at these directions from 
an originally homogeneous distribution. The method can also be useful to 
get average values on a complete set of grains, if one backrotates the surface 





Fig. 4.18. a) and b): Growth of a polycrystalline NiCu structure with 30 grains 
on a domain of size 100 x 100 x 100. The isosurfaces of selected grains for two time 
steps are displayed. c): Heat treatment with partial melting along the grain vertices 
(from [WN06]). (See page 688 for a colored version of the figure.) 


144 B. Nestler, F. Wendler 


Fig. 4.19. Analysis of surface orientations for an evolving grain (top) using a stere- 
ographic projection of the surface normals for three time steps in a)—c). A spherical 
grain turns into an facetted shape with distinct octahedral faces due to surface 
energy anisotropy (triangle spots on diagonals in image c)). 





normals according to the respective grain orientation before performing the 
stereographic projection. 


4.5 Grain Structures in Geological Materials 


As shown in the previous subsection, a numerical scheme which efficiently han- 
dles a large number of phase-field parameters can be used to simulate grain 
growth. We applied our model to two basic geological topics in microstructure 
evolution: first, grain growth with melt inclusions and second, the crack-seal 
mechanism in rock veins. Both processes are characterized by a strong inter- 
dependence between triple point movements on a small length scale and the 
resulting large-scale grain morphology. 


Geological Microstructure Evolution with Melt Inclusions 


Rock formation takes place in deep inner layers of the earth’s crust under high 
temperatures and pressures, where microscopic rock grains are in equilibrium 
with a low fraction of melt. Fig. 4.20 shows a thin section of an experimental 
model grain system, where some special local geometries are marked: fluid 
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Fig. 4.20. Thin section of an experimental geological model system with grains and 
intermediate melt pockets (from [BB05]). Typical morphologies of liquid inclusions 
at grain boundaries and triple junctions are marked (see text). 


boundaries (1), grain boundaries with melt lenses (2), edge shaped melt in- 
clusions (3), ’dry’ triple junctions (4) and large melt pools (5). 

The grain structure is expected to be porous with a high permeability: the 
melt inclusions are thought to be connected in the third (hidden) dimension 
allowing for an unlimited exchange of liquid between them. For the simula- 
tions, the driving forces fọ,(T,c, @) for the phase transition in the phase-field 
Eq. (2.20) were set to zero. Since the total amount of liquid phase is con- 
served, a new mechanism for the preservation of phase volume was realized 
in the framework of the multi-phase-field model. Volume preservation of an 
individual phase a in the domain 2 requires 


[0 = const, > a f a =0. 
Q Q 


This can be numerically fulfilled by a redistribution of the changed phase 
volume after each time step at the diffuse interface, weighted with the local 
kinetic coefficient 7(¢, Vo). To add the missing phase volume fraction exclu- 
sively to the interface area, the first derivative of the interpolation function 
h(@) is used, which differs from 0 only for 0 < ¢ < 1. The nonlocal correction 
term produced in this way was shown to be equivalent to an additional driving 
force —Hah( pa) on the right hand side of Eq. (2.20), where Ha represents the 
strength of this artificial force [WSN06]. 

2D Simulations were carried out for a various number of solid grains and 
one fluid phase with preserved volume (see [WZ*06]). To exclude the influence 
of the domain boundaries, periodic boundary conditions were applied. In a first 
step, wetting angles and local dynamics of the fluid inclusions were examined. 
Therefore, a special periodic geometry consisting of 4 hexagonal grains in Fig. 
4.21 was designed. This configuration reveals 120° angles between the grains 
and proved to be stable under calculation for identical solid-solid and solid- 
liquid surface energies, Ys, = Ys:. Melt inclusions were inserted for different 
ratio of the surface energies 755 /7s: and at different positions along the grain 
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Fig. 4.21. Periodic test geometry with an inserted melt inclusion (light grey) at 
the horizontal boundary, shown for three time steps (from left to right). (See page 
688 for a colored version of the figure.) 


boundaries. Fig. 4.21 shows the evolution of a square shaped inclusion, which 
slowly moves into the nearby triple point. At this location, the total curva- 
ture is minimized and the fluid phase reveals stable angles according to the 
relationship 2cos(6/2) = Yss/Ys1 [BWT79]. If the liquid phase is distributed 
at different positions along a grain boundary, a coarsening process starts in 
the course of which grains with higher local curvature shrink faster until only 
one melt pocket survives. This is also initiated for identical shaped inclusions 
due to a small amount of artificial noise. On a larger scale, the evolution of 
grain ensembles was studied. Using a Voronoi partitioning algorithm, a ran- 
dom grain structure was produced and circular melt inclusions were placed 
at the triple points. Besides the grain growth, a coarsening also of the liquid 
pockets occurs associated with an enlargement of their average diameter. In 
Fig. 4.22, two steps of the evolution can be seen: Within a small time period, 
the melt inclusions at the triple points shrink away and on a comparatively 
longer time scale the inclusions adjacent to four or more grain phases (quadru- 
ple points, right picture) perform a ripening process. This is in agreement with 





Fig. 4.22. Coarsening of a grain structure with fluid inclusions (light grey) at the 
triple junctions for three time steps and for a ratio of the surface energies Yss/7/s1 = 
1.2. (See page 688 for a colored version of the figure.) 
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the Neumann-Mullins law which anticipates an area shrinkage rate 4 A(t) of 
4 A(t) x (n — 6), with n being the number of edges of the evolving grain 
[Mul56]. Finally most of the melt is collected in the central hexagonal area. 


The Crack-Seal Mechanism in Rock Vein Formation 


Rock veins are clearly observable separated regions in a rock matrix consisting 
of a different polycrystalline mineral. They were formed in solutal growth from 
a hydrothermal solution which infiltrates a macroscopic crack in the rock. In 
the veins, a various number of block shaped or fibroid textures were found, 
an example is shown in Fig. 4.23a). Essential for the formation of these elon- 
gated fibrous crystals is a repeated opening of the crack and a resealing with 
the mineral phase. In the resulting morphology, the history of the process is 
stored, which takes place on a long geological time scale, whereas the seal- 
ing of an individual crack opening happens quickly. For the 2D simulations, 
inner boundaries were introduced to represent the outline of the crack (see 
[NSW06]). As shown in the scheme in Fig. 4.23 b) the rock matrix is modeled 
via two barrier areas on top and bottom, not linked to memory and con- 
taining no phase-field information. The barrier cells serve as markers for the 
boundary cells next to the gap in between, which contains the liquid phase. 
As for the outer domain boundary, any boundary condition can be chosen at 
the barrier walls. To initiate the polycrystalline growth process, a large num- 
ber of seeds of 10 different orientations is distributed along the upper crack 
boundary. After the complete solidification of the liquid gap, the lower part of 
the barrier is shifted a definite number of cells in y- and x-direction to model 
a crack opening combined with a shear. At the x borders of the simulation 
domain, periodic conditions were assumed. The additional non-barrier cells 


a) b) 
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Fig. 4.23. a) Fibroid structure in a rock vein (taken from [Hil00]). b) Schematic 
view of the simulation setup consisting of the rock matrix (barrier cells), of the inner 
boundary, solid starting grains and the liquid gap area. 
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in the simulation grid are initilized as liquid phase. This process is repeated 
until the lower boundary of the simulation domain is reached. For the first re- 
sults we desist from introducing a diffusion field or treating the advective flow 
into the opened crack. A driving force is established only between the solid 
phases with different orientations and the liquid phase. To amplify the role of 
the solid-liquid phase transition, solid-solid kinetic coefficients were chosen as 
72, = 107% (cp. Eq. (2.21)). Various anisotropies and crack geometries were 
studied. As an example, three time steps of two different, randomly initiated 
simulation runs are depicted in Fig. 4.24 for facetted anisotropies (Eqn. 2.6) 
of the grains. Contrary to the case of weaker anisotropies, always grain orien- 
tations with optimal alignment along the opening-shear direction of the crack 
survive the selection process (dark and light grey regions). The morphology of 
the resulting grain boundaries is dominated by the geometry and the opening 
and shear rate of the crack. The dependence of the growth morphology on the 
shear rate (the horizontal crack movement) is demonstrated in Fig. (4.25a) 
and b)) for two simulation runs. A cubic surface energy anisotropy in the form 
of Eqn. (2.5) with sı = 0.08 was chosen. In the case of a vertical opening, 
also grains with strongly misaligned growth directions (medium grey) tend to 
be selected and form vertical grain boundaries. An increased horizontal shear 
leads to a wave-like pattern with increasing frequency and a selection of the 
optimal growth directions (dark and light grey). 





Fig. 4.24. Polycrystalline grain growth in a crack-seal process with facetted 
anisotropy of the surface energies of the grain boundaries. a) and b) show three 
time steps of two simulation runs with different starting grain distribution. (See 
page 689 for a colored version of the figure.) 
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Fig. 4.25. Effect of different shear rates on the resulting morpholgy. From left to 
right: no shear, Ax = 10 cells, Ax = 20 cells. Two simulation runs a) and b) with 
different grain distributions are displayed. (See page 689 for a colored version of the 
figure.) 


5 Conclusion 


A phase-field model for multiphase solidification has been formulated in a gen- 
eral and thermodynamically consistent way. Explicit expressions for different 
energy density contributions are given, which are appropriate to the specific 
physical process. Multicomponent diffusion in the bulk phases including inter- 
diffusion coefficients as well as diffusion in the interfacial regions are discussed. 
Anisotropy of both, the surface energies and the kinetic coefficients is incorpo- 
rated in the model formulation. Simulation results of dendritic, eutectic and 
grain growth show the capability of the model to describe phase transitions 
and complex multiscale microstructure formation. Building upon results on 
modelling rapid solidification [AW*98, Gal01] and on deriving a thin interface 
analysis [KR96, A199, KKS99, MWAO0, Kav01], challenges for future devel- 
opments are the extension of the presented model to strong nonequilibrium 
solidification and to small undercooling situations. The effective numerical 
multi-phase algorithm can be utilized in a straight way to incorporate fluid 
flow (see [SNW05]) and elasticity (in preparation). 
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1 Introduction 


In this article we review recent attempts to understand the interaction of dif- 
ferent length and time scales in phase separating systems with elastic misfit. 
Phase separation occurs for example if an alloy is quenched below a critical 
temperature, where a homogeneous mixture of the alloy components is not 
stable. The early stage of the separation process, where different phases, char- 
acterized by the respective concentrations of the alloy components, appear is 
called spinodal decomposition. 

The Cahn-Hilliard model [CH58] and its extension with elasticity, the 
Cahn-Larché model [CL82, CL73], have originally been introduced to model 
spinodal decomposition. Later numerical simulations (see e.g. [E1189]) and 
formally matched asymptotic expansions (see [Pe89]) showed that the Cahn- 
Hilliard equation can also describe a process on a slower intermediate time 
scale in which the regions occupied by the phases rearrange in order to de- 
crease their free energy. In the case that elastic contributions can be neglected, 
the free energy is essentially given by the surface energy and the evolution 
leads to nearly spherical disjoint components, called particles (see Fig. 1.1). 
If anisotropic elastic effects are present the shapes resemble the anisotropy of 
the elastic energy (see Fig. 1.2). 

In the late stage, when the system has already minimized its energy lo- 
cally, interactions between particles become important. In the case that no 
elastic energy is relevant, small particles shrink, while larger ones grow, a 
coarsening process known as Ostwald Ripening. The influence of elastic in- 
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Fig. 1.1. Evolution starting from a perturbation of a uniform state. (See page 690 
for a colored version of the figure.) 





Fig. 1.2. Alignment of interfaces driven by homogeneous, anisotropic elasticity. (See 
page 690 for a colored version of the figure.) 


teractions, e.g. through an elastic misfit due to different lattice constants, 
can drastically influence the coarsening process. The shape of the particles 
changes from spherical to cuboidal or plate shape, particles can align or even 
split. In particular on the large time scale the elastic energy which scales like 
a volume becomes comparable to the surface energy and it might be possible 
to stabilize the coarsening process (“inverse coarsening” ). For a review on the 
modelling of phase separation in alloys with elastic misfit we refer to [FPL99]. 
To model the late stage regime, often so called sharp interface models are 
used, which also appear as singular limits of the Cahn-Hilliard equation (see 
[GK06]). In contrast to the latter, the boundary between different phases is 
given by a hypersurface. 

In this overview we will discuss both the Cahn-Hilliard equation with 
elasticity (the Cahn-Larché system) and a Mullins-Sekerka type model with 
anisotropic and inhomogeneous elasticity. Although we will also discuss some 
aspects of modelling and mathematical analysis, our main focus will be on 
computational aspects. 

First, we will introduce the governing models and their interpretation as a 
gradient flow in Sect. 2. The latter will be relevant for the set up of a reduced 
model to simulate large particle ensembles. In Sect. 3 we study the effect 
of elastic contributions on spinodal decomposition within the Cahn-Larché 
model. In Sect. 4 we will explain how the Cahn-Larché system can be solved 
efficiently and present computational results for the Cahn-Hilliard equation 
and the Cahn-Larché system. In Sect. 5 we study coarsening rates for a large 
system of particles. Here we observe a transient coarsening behaviour in the 
Cahn-Hilliard model without elasticity and we will also see effects of elasticity 
on the ripening process. 
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In Sect. 6 we introduce a boundary integral formulation of the Mullins- 
Sekerka evolution and a corresponding boundary integral method. Simulations 
for systems with a few particles will be presented, which in particular show 
typical particle shapes and display when a certain pattern such as alignment of 
particles appears. Finally, in Sect. 7, we will use the gradient-flow perspective 
for the Mullins-Sekerka evolution to derive a reduced model, in which particle 
shapes are extremely simple. With this approach we can efficiently simulate 
larger particle systems. 


2 The models 


2.1 The Cahn-Larché model 


We consider the case of a binary alloy, i.e. two alloy components are present 
with concentrations cı and cy. We choose the concentration difference c = 
Cı — Cg as variable which due to the constraint cı + co = 1 determines the 
concentrations. The deformation field is denoted by u and since we consider 
models that are based on linearized elasticity we introduce the linearized strain 
tensor 


1 
e(F):= a(F + FT), with F = Vu. 


The free energy of the system is then given by 
geaz j {2iveP + Yle) + W(c, Vu)} de (2.1) 
Q 


where 2 C R? is a bounded domain, y > 0 is a small interfacial parameter, 
w:R-— Ris the non-convex free energy density and W : R x R4*4 — R is 
the elastic energy density. A homogeneous free energy density Y for a mean 
field model at a fixed absolute temperature is 


vc) = £2 {(1+¢)n(1+¢)+(1—¢)In(1—c)} + 1 —c’). (2.2) 


Here 6, is the critical temperature and R is the gas constant scaled by the 
(constant) molar volume. For @ below the critical temperature ĝe the energy 
density ~ has two global minima c_, c} and hence a non-convex form. For 
shallow quenches, i.e. 0 < 0 < @ one usually takes a smooth approximation 
o (2.2) of the form 


w(c) = (ce? —a”)?, O<a<1, b>0. (2.3) 


As elastic energy density W we take a quadratic function in the strain 
tensor £ and set 


W (c, Vu) = 5 (Vu) —&(c)) : C(c)(e(Vu) — E(c)). 


156 H. Garcke, M. Lenz, B. Niethammer, M. Rumpf, U. Weikard 


Here =(c) is the symmetric misfit strain (also called eigenstrain), C(c) is the 
fourth rank elasticity tensor and A : B := tr(A’B) for linear mappings A 
and B. As the elasticity tensor C is assumed to be symmetric and positive 
definite we obtain that =(c) is the energetically favourable and hence stress 
free strain at concentration c. Typically € is affine linear, i.e. 


zle) =e'+e*e 


where e!,e* € R?*4 are symmetric. We allow for an elasticity tensor that 
can be different for the two components and hence C can depend on the 
concentration c. 

For an isotropic material we obtain 


C(c)e = 2u(c)e + A(c) tr(e)Id 


where the Lamé moduli u and A depend on the concentration c. 
For a material with cubic symmetry we have 


C(c)e = 2u(c)e + A(c) tr eld + p'(c)diag e 


where diage is the matrix that one obtains, if all off-diagonal entries are set 
to zero. In general C is an arbitrary fourth rank tensor C(c) = (Ci; iz (e)) 
and using the symmetry conditions 


Cig ug = Cig yi = Cyan = Cog ag 


one can compute that for d = 3 there are 21 degrees of freedom in C which 
of course in general will be restricted by crystal symmetry. 

For example in a cubic system we obtain that C1111 = C2222 = C3333, 
Giijà = Cairn (for i,j, K mutually different), C2323 = C3131 = C1212 and all 
other entries in C either follow from the above by symmetry or they are zero. 
Sometimes a fourth rank tensor in R? is denoted by C;; (Voigt notation). In 
this case the indices 7,7 take values 1,2, 3,4,5,6 and they stand for the pairs 
11, 22, 33, 23, 31, 12 in the original notation. This means in a cubic system we 
only need to specify C11, C12 and C44. All other parameters are determined 
by symmetry. For a discussion of other symmetry classes we refer to Gurtin 
[Gu72]. We will also always assume that C(c) is positive definite and bounded 
uniformly in c. 

Taking mechanical effects in the Cahn-Hilliard model into account we ob- 
tain the system 


se ads (2.4) 
es a = —yAc + y (c) +Wele, Vu); (2.5) 
0= ae =-V.- W r(c,e(Vu)), (2.6) 


~ du 
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which we sometimes also call the Cahn-Larché system (see [CL82, CL73]). 
Here $B denotes the first variation of E with respect to c and We is the 
partial derivative with respect to c (the same notation holds with respect 
to u). We remark that for simplicity in (2.4) the mobility is taken to be 1. 
The chemical potential w is the diffusion potential and is given by the first 
variation of energy with respect to concentration. The quantity S = W p with 
F = Vu is the stress and hence (2.6) are the mechanical equilibrium equations 
from the theory of elasticity. 

The set of equations then has to be completed by appropriate boundary 
conditions which can be e.g. periodic boundary conditions or Neumann bound- 
ary conditions for w and c and a prescribed normal stress at the boundary for 
the u-equation. 


2.2 The Cahn-Larché system as a gradient flow 


The Cahn-Larché system can be viewed as a gradient flow. A gradient flow 
is the flow in the direction of steepest descent in an energy landscape. This 
framework requires a differentiable manifold M, and a vector field f, which 
attaches a tangent vector f(x) € TyM to every point x € M. The vector 
field f defines a dynamical system « = f(x). A gradient flow is a dynamical 
system where f is the negative gradient —grad€ of a function € on M. The 
notion of a gradient requires a Riemannian structure, that is, a metric tensor 
g on M. Then, the precise formulation of « = —grad Ez is 


ge (&(t), y) + (diffEn(),y) =0 for all y E€ T,(4)M and for allt. (2.7) 
If we choose y = (t) we observe that the value of € decreases along trajecto- 
ries. 

We now give two possibilities to view the Cahn-Larché system as a gradient 
flow. First we choose 


m= fenor] f cdr=f each, 
Q Q 


where co : N — R is the concentration at time zero. The tangent space is 


then given as 
TM := {vi 2-R| f var =o} 
R 


and the metric tensor on ToM is given by the H7! norm, that is 
gc(v, 0) = | V itv + Vua dx 
Q 


where py (respectively ug) has mean value zero and fulfills 


[Yu vede= f vede for all € € HHQ). 
R R 
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We remark that 
gc(v, 0) =| [yt dx. 
R 


In what follows we will write p, = (—A)~!v. 


We define 
E(c) = [ Give? + v0} de + min f W(c, Vu) dx (2.8) 
and claim that 
(diff E(c), 0) = f {-yAc + y (c) + W.-(c, Vuc) }0 dz 
2 


where ue solves (2.6) for given c. It should be remarked that the last term 
in (2.8) can be written as fp W(c, Vu-) dx which means that also u depends 
on c. Since (2.6) holds it can be computed that this dependence gives no 
contribution to the differential. We now obtain that 


(diffE, 0) = ge(Orc, 0) = | (4) taci dx 
Q 


is equivalent to (2.4)—(2.5) if we set w := (—A)~! Oc. 
Another gradient flow perspective for the Cahn-Larché system uses the 
energy (2.1) and uses the manifold 


M:= fe u): 2>RxR? | f c= j co + boundary conditions for u} 
Q Q 


with a corresponding tangent space T(,,,)M. The metric tensor is then chosen 
to be degenerate with respect to u. In fact we choose 


He,u)((v, w), (0,W)) = f Villy: Vua dx 


with py and uz as above. 
We remark that the gradient flow property has been used in Garcke 
[Ga03b] to show existence of solutions to the Cahn-Larché system. 


2.3 The Mullins-Sekerka evolution 


In the Mullins-Sekerka model the interface between two phases is described 
by the boundary {x = 1}, where x is the characteristic function of one of 
the phases. We restrict our presentation to the case 2 = R4. 

The evolution is driven by the reduction of an energy, which is given by 


eowuli= f vxlt f W(x Vu) de, 
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where fra |Vx| denotes the perimeter of the set {x = 0} in R“. That is, the 
energy is the sum of interfacial area, which is due to surface tension, and an 
elastic part, which depends on x and the deformation field u. In the following 
we consider linearized elasticity, that is we take 


W(x, F) :=xW1 (F) + (1 — x)Wo(F), 
1 
Wal F) :=30 (eE) — Za) : (e(F) — Za), 
in particular, we allow as above that the elasticity tensor is anisotropic as 
well as inhomogeneous, i.e. different in each phase; and we allow for a misfit 
between the two phases, €; Æ €o. The misfit may also be anisotropic, i.e. it is 
not necessarily a multiple of the identity. 


The evolution of the interface is driven by the gradient of the chemical 
potential u, that is the normal velocity v is given by 


v = [ð u] on the interface I := {x = 1}, (2.9) 
where v is the outer normal on {x = 1}, and 


Oy pt) := li Ov LL — li Oy 
VOUT e OE 2 o 


denotes the jump of the normal component of the gradient across the interface. 
The chemical potential u is determined for each time t via 


—Au=0 in the bulk R4\F, (2.10) 
w= K+v-[E(u)ly on I’, (2.11) 


where the jump of the Eshelby tensor 


Ow 
can be computed from the solution of the elastic equation (see below in (2.13), 


(2.14)). 
We assume that the mechanical fields relax at each time t instantaneously 
to equilibrium, which yields 


divo = 0, in RAL, (2.13) 
[o-v] = 0, on T. (2.14) 


Here, ø denotes the stress tensor, which is given by o = W (X Vu). 


2.4 The Mullins-Sekerka evolution as a gradient flow 


We now argue that also the Mullins-Sekerka free boundary problem formally 
fits into the gradient flow framework: M has to be chosen as the manifold of 
all sets, representing the particle phase, with fixed volume, i.e. 
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M := fx Re > (0,1}| f x dx = V , suppx CC R°} . 
Ra 


The tangent space TyM can be described by all admissible normal velocities 


of I’, that is 
TyM = fu:r >R] f van =0} 
T 


The metric tensor is given here by the H7! norm in the bulk. More precisely 
gx(v, 0) = | Viv: View da, 
R4 
where py (respectively ug) is the bounded solution of the elliptic problem 
-Am =0 in RAS, (2.15) 
[Viv v] =v onl. (2.16) 


After an integration by parts we obtain 


oxo) = f -möh 
r 


Our assumption in the previous chapter was that we have a clear separation 
of time scales such that the mechanical fields can be assumed to relax instan- 
taneously to equilibrium given a phase distribution x. Thus, we replace our 


energy by 
= f vx + min f W(x, Vu) dx 
uU Rd 


Indeed, it follows that min, fga W(x, Vu) dx = faa W(x, Vuy) dx, where uy 
solves (2.13), (2.14) for given y. 
We have now all the ingredients for a gradient flow evolution at hand. In 
order to compute it explicitly we have to calculate the differential of €. 
First, we recall the well-known result = the first variation of surface 
area is the mean curvature, that is for E(x) := f |Vx| we have 


(diff€, 6) = ie KodHt! 
r 


for all ò € TyM. The differential of the elastic part of the energy is (compare 
[Ga03a]) 


(difê, 3) = —E[ys5]|5—0 


= f (w (x, Vu)1 — (Vu)? Zvu) v- vð dH! 
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The part in brackets is again the Eshelby tensor E(y,Vu) (2.12), and the 
jump of its normal part across the interface is the contribution of the elastic 
energy to the evolution of the particles. 

Therefore we have for the gradient flow, evaluating (2.7), that 


0 = gy(v, 0) + (diffE, ù) = | (k+v: [E(x, Vuy — wy) vdH* + (2.17) 
r 
for all v € TyM. We see in fact that for the direction of steepest descent the 


corresponding potential satisfies — up to an irrelevant additive constant — the 
Gibbs-Thomson law with elasticity (2.11). 


3 Spinodal decomposition 


At high temperatures the free energy 7 is convex and hence a homogeneous 
state is stable. If now the system is quenched below the critical tempera- 
ture 0e the homogeneous state becomes unstable and different phases form 
which can be distinguished by a different chemical concentration. This pro- 
cess happens on a very short time scale and the regions with different phases 
have sizes which are given by a small length scale. If the elasticity tensor 
or the eigenstrains are anisotropic, one will observe that the phase regions 
orientate themself in certain directions (see [GMW03],[GRW01]) for numeri- 
cal simulations). We will now describe how one can make these observations 
quantitative. We first solve the linearized Cahn-Larché system with the help 
of Fourier transformation (see Khachaturyan [Kha83]). Then a method de- 
veloped by Maier-Paape and Wanner allows to show that one will see certain 
patterns after spinodal decomposition with a probability close to one for the 
nonlinear evolution (we refer to Garcke, Maier-Paape and Weikard [GMW03] 
for details). We will assume here that C does not depend on c which is the 
elastically homogeneous case. 

Linearization of the Cahn-Larché system around a constant stationary 
state (c, u) = (Cm,0), where cm E R is constant, gives 


Oye = (—A)(yAc — Y” (em)e + €* : 8), (3.1) 
7.6 =0, , 
S = C(e(Vu) —e*c). (3.3) 


We consider the system (3.1)—(3.3) on 2 = (0,27) x - - -x (0,27) with periodic 
boundary conditions. For a given c we can compute u from (3.2)-(3.3) by 
Fourier transformation and we can express «* : S as a function in c. The 
result will be denoted as 

L(c) =: S. 
For 


y.(x) =e", i being the imaginary unit, 
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with 
k= (K1,...,a) € ZË 


one obtains (see [Kha83, GMW03)]) 


L( pn) = LK) Px 


with 
L(K) = e* : (C[Z(K)S*kKT] — S*) 


where S* := C[e*] and Z(«) is the inverse of 
d 
ZS SO gmnts tin 
j,m 


An important observation is that L is homogeneous of degree 0 which implies 
that £ is a pseudo-differential operator of order 0. The function L can be 
computed more explicitly in certain cases, e.g. if C is isotropic or has a cubic 
symmetry (see [GMW03]). In the particular case of cubic symmetry one ob- 
tains that certain directions x € Z are stronger amplified by L than others. 
This has important consequences for (3.1)—(3.3). If we consider solutions to 
(3.1)-(3.3) of the separation of variables form 


c(x,t) = f(t, 


we obtain 
fO) =at ,aweR, 


with 
Arg = |K]? (qlk? — Y” (cm) + L(K)). 


If cm is such that Y” (cm) < 0, one obtains in the case without elasticity that all 
k with a certain wave length are amplified the most. Now in case of anisotropic 
elasticity also the direction of xk plays an important role when we want to 
determine the most unstable waves. It turns out (see [Kha83, GMW03] and 
the references therein) that in case of cubic anisotropy either directions parallel 
to the coordinate axes or directions parallel to the diagonals of the coordinate 
axes are amplified more by the influence of elastic interactions. Which of the 
two cases occur depends on the parameter AC := Cy, — Cy2 — 2C44. One 
speaks of positive anisotropy if AC > 0 and of negative anisotropy if AC < 0. 

We will demonstrate this for the case of negative anisotropy. In Fig. 3.1 we 
show the most amplified eigenmodes and a typical function which is a linear 
combination of basis functions with these eigenmodes. In Fig. 3.2 we show a 
typical solution of the Cahn-Larché system after spinodal decomposition. We 
show the modulus of the Fourier coefficients and the sign of the concentration 
difference in the case of cubic negative anisotropy. One clearly sees the cubic 
anisotropy which is in contrast to the isotropic case where patterns do not 
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Fig. 3.1. The most amplified eigenmodes in the (#1, K2)-plane (left) and a typical 
pattern (right) for negative anisotropy (AC < 0) 
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Fig. 3.2. Cubic anisotropy of the elasticity tensor; modulus of the Fourier coefficient 
(left) and sign of the concentration difference c (right) 


follow a direction. In fact typical solutions look like in Fig. 1.1 to the left 
(see [GMW083] for more details). We also refer to [GMW03] for the proof of a 
theorem which roughly speaking says that with a probability close to one, the 
evolution to initial data which are randomly chosen out of a neighborhood of a 
uniform state will be dominated by an invariant manifold which is tangential 
to the most unstable eigenfunctions of the linearized operator. 


4 Numerical approximation of the Cahn-Larché system 


The Cahn-Larché system has a variational structure and hence it is natural 
to use a finite element method for the discretization. The formulation of the 
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Cahn-Larché system (2.4)—(2.6) is of second order in space and we will use 
continuous piecewise affine elements to approximate c, w and u. 

For a polyhedral domain 2 we choose a quasi-uniform family {7"}), 50 
of partitionings of 2 into disjoint simplices with maximal element size h := 
max{diam s}, so that 2 = Usera 5. Associated to T” is the finite element 


space of continuous piecewise affine elements 
= {p € C°(Q) | yg is linear for alls € Pen). 


To formulate a finite element discretization we introduce the lumped mass 
scalar product (-,-)” instead of the L? scalar product (-,-) as follows: For 
U1, V2 E CO) let 

(v1, v2)” = = fea" (viva) 


where 7” : C°(Q) — Sè is the interpolation operator, such that 
(x’n)(p) = n(p) for all nodes of 7”. 

Then a semi-implicit scheme for (2.4)—(2.6) reads as follows. 

We search for c”, w” : [0, T] — S” and u” : [0, T] — (S")4 such that 


(asc, p")"=-(Vw", Ve"), ; 
(w, p” =V", Vo") + (W (e), p)” + (Wele, Vu”), g”), (4.2) 
0=(e(Vu") — z(c”), C(c*)e(VE")) 


holds for all y” € 9”, €" € (S")4 and all t € [0, T]. 

In order to obtain a fully discrete scheme one needs to introduce a time 
discretization. The simplest implicit time discretization is the implicit Euler 
scheme in which the time derivative in (4.1) is discretized in the following way 


h h 
Ca Gy 
oogt (Aa a) 


h 
Tn 


Here we divided the time interval [0, T] into N steps with length 7, and set 
tn = J; Ti. The discrete solution at time tn is denoted by (c}, w}, ut). 
The resulting numerical scheme has been analyzed in [GRW01, GW05). In 
[GRW01] optimal error estimates have been shown in the case that C does 
not depend on the concentration (homogeneous elasticity). In the case of in- 
homogeneous elasticity a convergence proof has been given in [GW05]. 

The fully discrete scheme has the properties that mass is conserved and 
that the total discrete free energy decreases (see [GRW01, GW05]). The last 
observation is a consequence of the fact that the discrete problem reflects the 
gradient flow property of the continuous problem and this is an important 
fact in the analysis of the scheme (see [GW05]). 

It turns out that the so-called O-scheme [BGP87, MU94] leads to a more ef- 
ficient but hard to analyze time discretization. All the computations presented 
in the following are with the help of the O-scheme, but we made sure that com- 
putations with the implicit Euler scheme lead to qualitatively similar results 
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Fig. 4.1. Alignment of two particles (anisotropic inhomogeneous elasticity), adap- 
tive computational grids 


although with higher computational effort. We consider adaptive triangular 
grids in space and a corresponding a posteriori error control [GRW01, GW05]. 
The discrete linear systems were solved with the help of the BICG and GM- 
RES algorithms and for the nonlinear discrete problem we used Newton’s 
method (see [GRW01, GW05] for more details). 

Another approach to solve the Cahn-Larché system numerically uses spec- 
tral methods. We refer e.g. to the work of Dreyer and Müller [DMO00] and Leo, 
Lowengrub and Jou [LLJ98] and the references therein. Due to the nonlinear 
structure of the Cahn-Larché system, approaches based on spectral methods 
loose their efficiency. This is in particular true in the case where the elastic 
constants are different in the two phases (inhomogeneous elasticity). 

To conclude this section we report on some numerical simulations with 
the above robust and efficient numerical method. We have studied various 
qualitative effects of the Cahn-Larché model including homogeneous elasticity. 
We observe e.g. the following (see also [LLJ98]) 


e particles align their faces to the elastically soft directions of the material 
(see Fig. 1.2), 
particles align in rows (see Fig. 4.1), 
always the harder phase forms particles in the softer phase independent of 
the volume fraction (see Fig. 4.2), 

e inthe case of inhomogeneous elasticity one observes that particles do not 
merge when close to each other but instead repel each other (see Fig. 4.3). 


The numerical approach for the Cahn-Larché model turns out to be ef- 
ficient for ensembles ranging from a couple of particles to a few thousand 
particles and has been applied to derive experimental results on growth laws 
(see the following section). 
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Fig. 4.2. Effects of inhomogeneous elasticity: On the left side the green phase is the 
elastically harder one, the blue phase is softer. On the right side it is vice versa. The 
volume fraction of both phases are the same. (See page 690 for a colored version of 
the figure.) 
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Fig. 4.3. Repulsion of two particles due to anisotropic elasticity 





5 Ostwald ripening within the Cahn-Hilliard and 
Cahn-Larché models 


A relevant issue in coarsening systems is an estimate of the coarsening rate of 
the system. The latter can be expressed by the rate of growth of mean particle 
size or by the rate of decrease of surface energy. Dimensional arguments give, 
that the coarsening rate in diffusion controlled coarsening, as described by 
the Cahn-Hilliard equation or the Mullins-Sekerka evolution, is proportional 
to t'/3, Weak time-averaged upper estimates of this coarsening rate have been 
established in [KO02] for the Cahn-Hilliard model without elasticity. It turns 
out that the proof goes through without any difference for the Cahn-Hilliard 
equation with elasticity. Whether or not this estimate is sharp, say, for generic 
data, in the case with elasticity is however not clear. 

Using the adaptive finite element method described in Sect. 4 we made 
an attempt to study coarsening rates for large particle systems. We first con- 
sidered the Cahn-Hilliard model without elasticity. Due to the adaptive grids 
(see Fig. 4.1) and the time discretization based on the O-scheme simulations 
with about 4000 particles after the initial phase of the particle formation 


Phase Separating Systems with Elastic Misfit 167 








0.175 

















1-4 7 





Fig. 5.1. Graph of the energy at an early and a very late stage of the evolution 
(two graphs on the left side), different time steps of the evolution (on the right side). 
(See page 691 for a colored version of the figure.) 


have been feasible. Let us give a brief summary of the results for the original 
Cahn-Hilliard model without elasticity (details can be found in [GN*03}): 


e The observed decay rates for the energy and the growth of the averaged 
particle size are in correspondence with the basic LSW theory. There in 
2D one has a decay of the energy like: 


E=Ct3. 


e Depending on the initial data (arbitrary distributed small particles (cf. 
Küpper, Masbaum [MK94]), a slightly perturbed homogeneous mixture, a 
homogeneous mixture with arbitrary positioned localized seeds for parti- 
cles) we observe a rather long intermediate behaviour with energy decay 
and particle growth rates different from the expectations. 


Figure 5.1 shows results obtained by our extensive numerical tests. On 
the left the energy is plotted in double logarithmic scale over time. In this 
representation the expected polynomial decay should turn out as a straight 
line. This is the case albeit with the unexpected exponent of 4. In fact, we 
made the observation that after spinodal decomposition the system settles for 
a wrong exponent for quite long time. Depending on the volume fractions of 
the two phases the exponents observed range from —} to the expected —} in 
the case where we start with equal volume fractions (see Figure 5.1 upper left 
graph). However, after a long time the speed of the energy decay changes and 
we see a behaviour in line with the expectations. In the example shown the 
energy decay at times t > 7 differs significantly from the behaviour at earlier 
times. We observe a graph like in the left lower part of Fig. 5.1. Here we see 
time phases where the energy goes according to 


Ex Ct? 
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which are intersected by short periods, when the energy decays faster. In 
these short periods one sees particles vanishing, whereas between these steeper 
declines particles are just growing and shrinking with the number of particles 
constant. 


In the case of the Cahn-Larché model with elasticity we have observed 
so far, that the coarsening rates are affected by the presence or absence of 
elasticity as well as by the homogeneity of the elasticity. Anisotropy seems to 
play a minor role (cf. Fig. 5.2). However, it may be the case that the coarsening 
rates change at later times as in the standard Cahn-Hilliard model. 
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Fig. 5.2. Graph of the nonelastic part of the energy. (See page 691 for a colored 
version of the figure.) 


6 Simulation of the sharp interface model 


Different from the diffuse interface model which allows a straightforward dis- 
cretization via finite elements (cf. Sect. 4) the interface propagation in the 
Mullins-Sekerka sharp interface model (2.9)—(2.14) and the computation of 
the corresponding energy contributions and their variations has preferably 
been implemented using the boundary element approach. Hence, the linear 
elliptic subproblems for the chemical potential (2.10),(2.11) and the elastic 
displacement (2.13),(2.14) are transformed into integral equations on the in- 
terface between the different phases. They are then dicretized based on a 
collocation-type ansatz. Thus, for d = 2 the interfaces are resolved by polygo- 
nal lines whose vertex positions are updated in the actual evolution. The two 
dimensional, evolving phase domains have not to be meshed and adapted in 
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each time step of the evolution. Indeed the interface geometry enters the for- 
mulation via appropriate Poisson type kernel functions and kernel functions 
for linear, anisotropic elasticity to be integrated in the collocation ansatz on 
the polygonal lines [Ha95]. In each time step only the vertex positions rep- 
resenting the interface have to be updated. This discretization approach has 
among others already been successfully applied by Voorhees, Lowengrub and 
coworkers [ATV01, JLL97, TAV04a, TAV04b, VMJ92). 

Let us first depict this transformation for the chemical potential subprob- 
lem. Let Yr, (£) := — + ln |x — zo| be the fundamental solution for the Lapla- 
cian in R?, ie. Av),,(a) = (x — xo) in the sense of distributions. Applying 
Greens formula we obtain for points zo on a smooth interface I’: 


po) = f {](©)2, 400) ~ Veola) [3v] (2)} de 
+ 1 { u(x) Osx (a) — Yro l£)ðsulr)} dz. 
OBR 


Here, u is the chemical potential, [-] the usual jump operator and Br is a 
large ball containing all particles. Recalling that u is continuous across the 
interface (2.11) the jump of the chemical potential [u] vanishes. Furthermore, 
for R — co the integral over Bpr converges to a constant c(t) solely depending 
on time. This additional degree of freedom reflects the conservation of the 
overall particle volume. Finally taking into account the governing equation 
for the normal velocity of the interface (2.9) the Mullins-Sekerka problem can 
be rewritten in the following form: 


Let I(t) be the interface with normal velocity v, « its curvature, and 
E = E(x, Vu) the Eshelby tensor (2.12), then at time t 


(x0, t) + [E(xo, t)]u(@o, t) : v(wo, t) + c(t) = | Pro (x)v(x, t) dH" (6.1) 
r(e) 
for every zo on I(t) and the velocity field fulfills the constraint 0 = fp aY dH. 


The solution of the quasi stationary elastic subproblem is required for the 
evaluation of the Eshelby tensor E on I’. Let C'a be the elasticity tensor, where 
the index a indicates either the matrix or the particle phase, and denote by 
€q the misfit. Let u be the displacement on the interface and 7 the normal 
stress defined as the difference between actual strain and eigenstrain in normal 
direction: T = ov = C(e(Vu)—€q)v. We recall from (2.14) that 7 is continuous 
across the interface. Now we consider the matrix valued fundamental solutions 
w%, for linear, anisotropic elasticity from [CR78, Cle87] and obtain again by 
Greens formula an integral equation 


lve) = | doar aule) ~ V) (re) + Cačav) aH. 
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Let us remark that in case xo coincides with a vertex on a polygonal interface, 
a matrix c(xo) depending on the direction of the two edges at zo is applied 
to the displacement u(xo) on the left hand side replacing the factor $. Given 
the above integral equation for the matrix and for the particle phase, the 
displacement u and the normal stress T are up to a constant displacement 
uniquely determined. The computation of the Eshelby tensor requires the 
evaluation of the full displacement gradient Vu. For given u and 7 this gradient 
can be computed differentiating the above integral equation with respect to xo. 
This differentiation applies to the integral kernels, thus increasing the order 
of singularity. In particular, for the kernel 0o,,W%, a hypersingular integral 
has to be evaluated. 

In the actual spatial discretization the integral equations are assumed to 
be fulfilled at appropriate collocation points on a polygonal interface and the 
displacement, the normal stresses, the chemical potential, and the interface 
velocity are approximated in a corresponding discrete space. Two particular 
useful choices are either piecewise constant ansatz functions on the polygon 
segments and segment centers as collocation points, or a piecewise linear func- 
tions and vertices as collocation points. For the elastic subproblem, piecewise 
linear ansatz functions are the appropriate choice, since piecewise constant 
ansatz functions do not make sense with respect to the above sketched eval- 
uation of the deformation gradient on the interface. 

For the notion of a discrete curvature on the polygonal interface we refer 
to [Dz91] and define on vertex x; a curvature vector 


Tiji Ti ia 
leii: [leer ae—all (6.2) 


lei+iızzilltllei rill ? 
2 





Ky, := — 


where v; represents a unit length vector and «x; the discrete, scalar curvature 
required for a spatially discrete Mullins-Sekerka model. Finally, a suitable 
time discretization for (6.1) has to be considered. An explicit treatment of the 
discrete curvatures xk; would result in severe time step restrictions. Thus, we 
evaluate the normal direction — according to the above equation — at the old 
time step and redefine a semi-implicit scalar curvature as the scalar product 
of this time explicit normal field with a semi-implicit curvature vector. For 
the latter, we again follow [Dz91] and consider time implicit vertex positions 
but a time explicit edge length in the above formula (6.2). 

Let us depict two types of particle interaction in the presence of inhomo- 
geneous and anisotropic elasticity. Figure 6.1 shows the attraction of particles 
in case of a strongly inhomogeneous elasticity with a hard particle phase and 
a considerably softer matrix phase. Figure 6.2 renders the alignment of parti- 
cles, which can be observed in the presence of strongly anisotropic elasticity. 
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Fig. 6.1. Three time steps of a discrete Mullins-Sekerka evolution showing the 


attraction of two soft particles in case of isotropic but inhomogeneous elasticity. 
The matrix phase is four times harder than the particle phase 
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Fig. 6.2. Three time steps of a discrete Mullins-Sekerka evolution with particles 
lining up 


7 Reduced sharp interface model for larger systems 


In order to make simulations for large particle systems feasible, we now set 
up a reduced model of the Mullins-Sekerka evolution with elasticity. The re- 
duction is based on the observation that in the case of a cubic anisotropy 
in the elasticity, particles become quickly rectangular, whereas the long-time 
behavior is dominated by long-range interactions. This motivates to reduce 
the gradient flow of the Mullins-Sekerka evolution to the submanifold of rect- 
angular particles. We will see, that such a reduction is in very good agreement 
with the full evolution for a small set of particles. We will then also present 
first results for larger particle ensembles. 

We restrict our dynamical system to the submanifold M C M which con- 
sists of sets which are the union of disjoint rectangular particles aligned with 
the coordinate axes. 

To define M we first need to introduce some notation. As indicated in 
Fig. 7.1 each particle will be identified by the two points 


p= (P7, p*) = (pr, py), (vt py) E€ R? x R?. 
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ut pt = (pe, py) 
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p= (Pz, Py) |; 


Fig. 7.1. Configuration of one rectangular, axis-aligned particle 


We denote the edges perpendicular to the z-axes by bz and bł, the ones 
perpendicular to the y-axes by b; and oF, more precisely b> := {p7} x 


[py pil Of = {pt} x py pe] and by = [pr ” pi] x{p, hot := [pz pt] x {pt 
Consequently the volume of a eae p is given by ap = |b; | - |b, | and the 
boundary length by lp := |b; | + |by | + [bE | + [of]. 
The normal velocities of the sides are given by 
Up = (vz vg, v, ur) E R*. 


xUx Vy Vy 


Now our submanifold M can be identified with the space 


Vena 


where N is the number of particles, and the tangent space with the hyperplane 


L; (be (wy + (wf) 
oN] Hoz (wz + (w)? yeah 


N := {P= frist tn 





TpN := fv =A ha E 
The surface energy € can be expressed as 
E= J Ixi = 2D (WOds1 +0950. 
such that the variation of € with respect to õ € T,N is given by 
(diff€ č) = 2). (i)z + EDE + (vi), + NE). (7.1) 


We are now going to consider what interfacial condition is satisfied for the 
direction of steepest descent as given in (2.17). For that notice that we can 
for any vector w € R*” construct an element 6 € T,N by setting 
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7 De (wibse + (wybs)z + (wibi); + (wybs)y _ 
Vi = Wi — ——————— UE W L W. 


2 lv; 


Using (2.17) and (7.1) we easily find that there is a constant C such that for 
alli € 1,..., N we have 


2 
udH! = +f E, s]dH!+C 7.2 
Ta |(bi)8| woe! woe! ue) 


a 





where a = x,y, 8 = +,-, i =1,...,N, [By] := [E(x, Vu)] : vovg and 
f= T Je where E(x, Vu) is the Eshelby tensor from (2.12). The term 


oe can be interpreted as a crystalline curvature which also appears for 


iJa 


surface energies with cubic crystalline anisotropy (see e.g. Taylor [Tay78] and 
Gurtin [Gu93]). 

We summarize the above to define the evolution in the restricted setting. 
This will also motivate the order in which the equations are evaluated in the 
numerical algorithm. 

For a given particle configuration P = {p; : i = 1,..., N} we compute in 
view of (2.13), (2.14) the elastic deformation u from 


dive (x, Vu) =0 in RA\T, 
OW 
== = = .)8 
[u] = ER Vu): r| =0 on I := l i (b;)P 


for a = x,y, B = +,—, i= 1,..., N. The chemical potential u is given (cf. 
(2.15) and (7.2)) by 


Au =0 in R°V, 


2 
udH* = +f [E n e] dH! +C 
ie I(l ae 





for a = x,y, B = +,— and i = 1,..., N, so that the velocities can be derived 

from (2.16) via 
(ie = -lð oH for a = x,y, b= +,—, i=1,..., N. 

This evolution is well-defined until the side of a particle shrinks to zero. Then 

we remove this particle and continue with the remaining particles. 

To compare the reduced model to the full Mullins-Sekerka evolution, we 
simulate the interaction of a group of particles. Indeed, from a start config- 
uration for the full model, we compute a couple of small time steps to allow 
the particle shapes to relax to their preferred form (which happens rather 
quickly). For this configuration of nearly rectangular particles we construct a 
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Fig. 7.2. Evolution of the interface for the full Mullins-Sekerka model (top) and 
for the reduced model (bottom). In both cases the times t = 0; 0.0011; 0.004 are 
depicted. 


matching starting configuration for the reduced model. This configuration is 
then considered as the initial data both for the reduced and the full model. In 
both models particles below a certain small diameter are deleted completely. 
Figure 7.2 shows computational results for both models at different time steps 
of the evolution. The corresponding evolution of the interfacial energy over 
time is depicted in Fig. 7.3. Furthermore, plots of the elastic strain, stress and 
the energy density are compared in Fig. 7.4. Indeed, one observes a striking 
qualitative similarity and basically the same temporal behaviour. 





Mullins SekSrka = 
Redt cea N 1odel —— 














Fig. 7.3. The evolution of the interfacial energy is rendered for both models. Time 
and energy axis are logarithmic 
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Fig. 7.4. The trace of the elastic strain (both top rows, compared to zero strain 
and eigenstrain), the trace of the stress (middle) and energy density (bottom) for 
the Mullins-Sekerka (left) and the reduced model (right), in the initial configuration. 
(See page 692 for a colored version of the figure.) 
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Finally, time steps from the evolution of a moderately large particle ensem- 
ble with about one thousand particles and homogeneous anisotropic elasticity 
are shown in Fig. 7.5. 





Fig. 7.5. Coarsening of a moderately large particle ensemble 
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Summary. We derive a general theory for elastic phase transitions in solids subject 
to diffusion under possibly large deformations. After stating the physical model, we 
derive an existence result for measure-valued solutions that relies on a new approx- 
imation result for cylinder functions in infinite settings. 


1 Introduction 


The general idea of the present work is to derive a satisfying mathematical 
theory of phase transitions in single-crystals where the elastic properties of the 
material are described by nonlinear laws, see [Ogd97], [H0100]. This generalises 
common existing models with linear stress-strain laws, starting with pioneer- 
ing works by Khattchaturyan, [Kha83], and makes the model applicable to a 
somewhat broader class of materials. 

Beside prescribing the elastic behaviour of the material, the other funda- 
mental assumption is that all occuring phase transitions are reconstitutive, 
i.e. that there are no plastic deformations of the crystal. 

For the proof of existence of solutions to our new model, the experiences 
with the Stefan problem, [Vis96] and [LS95], were found to be very valuable. 
But as it turns out, due to the nonlinear aspects of elasticity, a formulation 
within the framework of Sobolev functions is not general enough. This leads in 
a natural way to the formulation of the solution as Young measures presented 
in Sect. 4.1. 

Historically, there is a variety of approaches to phase transitions in solids 
and it is not possible to give here a comprehensive overview. Instead, we refer 
the interested reader to the monographs and survey articles [Vis96, WS*93, 
Miil98, BS96, SR86] and [GHM01]. After these classifying comments we give 
now a general survey of this article. 
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We consider the elastic theory of single crystals at constant temperature 
where the free energy density depends on the local concentration of one or 
more species of particles in such a way that for a given local concentration 
vector certain lattice geometries (phases) are preferred. 

The local concentration of the molecules may change due to diffusion. The 
time scales typical of diffusion and of elastic deformation are usually signif- 
icantly different and in good approximation it is admissible to assume that 
the deformation adjusts infinitely fast to the local situation. In the developed 
model there is surface energy contributing to the free energy of the crystal and 
the model allows for m different coexisting macroscopic phases. We will as- 
sume that the crystal does not possess interstitials and that the time-evolution 
of the boundary of the domain is known. 

After deriving the physical model with the above properties we will discuss 
the existence of solutions to this model by means of an implicit time discreti- 
sation and will show that in the limit of vanishing time step the time-discrete 
solutions converge in the sense of Young measures on suitable Banach spaces 
with separable dual. To achieve this goal, we will derive a general approxima- 
tion result for cylinder functions in the infinite setting. Finally we can proof 
an energy inequality for the limit solution. 


2 Model and implicit time discretisation 


In this section we present our model. In the first subsection the mathematical 
equations are derived whereas the second subsection presents a reasonable 
solution strategy. This section is only heuristic and shall make the reader 
familiar with the general ideas. A deep mathematical treatment is done later. 


2.1 Derivation of the model 


To describe the physical phenomenon presented in the introduction, we make 
use of non-equilibrium thermodynamics, see [dGM84], [KP98], and of contin- 
uum mechanics, see [Gur81], [Cia88]. We neglect the atomistic structure of 
the crystal and disregard possible effects of the microstructure. The model is 
based upon the following basic considerations: The diffusion is caused by the 
gradients of the chemical potentials. The diffusive flux causes a local change of 
the free energy of the crystal. The free energy shall depend on particle density, 
elasticity of the crystal and phase parameter only with a term representing 
the surface energy of the boundary layers. 

In good approximation we can assume that the system is in mechanical 
equilibrium. For the analytical treatment, we will assume in this work that 
deformation and phase parameters are global minimisers of the free energy 
with respect to the present particle densities. 

At starting time t = 0 the crystal is described by a non-empty, bounded 
Lipschitz domain 2 C RÌ. Let R, := [0,00[. The evolution of 2 is given by a 
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family of C?-diffeomorphisms {% : R? — R? : t € Ry} with 

W(x) = z for all z € QQ, (2.1) 
(R+ x R? 5 (t,£) = G(x) € R?) € C? (R4 x R?, R3), 

(Ry x R? 5 (t,£) = Wr (x) € R?) € C?(R, x RÌ, RÌ). 












































The domain occupied by the crystal at time t > 0 is denoted by R; := %(2). 
The mechanical deformation is given by a family of mappings {®, : R4 x 
N — h : t E Ry} that satisfy for all t € R+ and for all x € 2 





Po(z) = x, (2.4) 
B, € Wt*9(Q,R) and 6,1 € Wt?+°(Q,, R?) exists, (2.5) 
det V®, > 0 a.e. in R. 


Here, 6 > 0 can be arbitrary. The conditions (2.5) and (2.6) ensure that 
@, are deformations. The condition 6 > 0 guarantees the integrability of the 
functional determinant and therefore that the volume is finite. Condition (2.4) 
reflects the fact that the initial state is undeformed. 

The space- and time-dependent particle densities of the n € N different 
species of molecules are described by p; : Ry x R? — R, i = 1,...,n. The 
following natural conditions are postulated for t € R4 and 1 <i<n: 


pi(t) € L*(Q), 
pi(t)>0 ae. in 2, 


[vit x) dx = Jo (x) dx > 0, (2.9) 
2 2 

X pilt) oP det VO <1 ae. in 2. (2.10) 
w=1 


The functions po, are given initial values with 


po; ELA(2), 0< i (x) dz, O<po, ae. in 2, (2.11) 
Q 


Spi <1 a.e. in Q, [Lame dx < |2]. (2.12) 
i=l Q i=1 


Equation (2.9) ensures the conservation of mass, (2.10) is due to the fact 
that the crystal does not possess interstitials and that the number of lattice 
positions in a volume element is a uniform constant. We assume (2.12). with 
strict inequality as we assume a non-vanishing vacancy density. 

Let m € N denote the number of different possible phases. The phases are 
described by a family of phase vectors {xi = (Xa) ja :2,-R:teE R4}, 
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where the initial value xo is given. Here, xj, (x) determines whether the mate- 
rial point x at time t € R+ is in phase j, 1 < j < m, ie. Xj, are characteristic 
functions. As any point in the crystal belongs to exactly one phase, the func- 
tions yj, fulfil for any 1 < j < m and t € Ry 


Xj (l ‘x Vid = 0, (2.13) 
Yxa = 1 (2.14) 
j=l 

Xj. E BV( 2 ). (2.15) 


With (2.14) we may write the surface energy between phase 7 and j at time t 
as 


Sz: LH) x L) > Ry = R4 U {+00}, 


2s (|Vpi| + |Vp2| — |V (p1 + p2)|), if J Veil < œ%,i = = 16) 
P1,P2) > t t $ 
co, otherwise. 





Assuming further that the densities of the surface energy Sij on the interface 
between phase 7 and phase j are positive constants with Sij = Sji, the surface 
energy Ff (xz) at time t € Ry can be introduced by 


FEL? (%,R™) Ry, p= (prea >) >) oujSt(pi,p5) (2-17) 





i=l j=l 

with oj; := $, 1 < i,j < m. Additionally we postulate for 1 < i,j,k < m 
Tij = Oji, (2.18) 
(i # jand k ¢ {i, j}) > a, < Oik + Okj- (2.19) 


So, energetically it is not favourable to add a third phase between two other 
existing phases. 

We assume that the volume density of phase j depends on the particle 
concentrations and the gradients of the deformations. It is given by a mea- 
surable function (see p.9 in [Bau92]) fj : R” x M? > R}, MF := M(k x k, R), 
KEN. 

Let for t € Ry 


Bi := L! (h, R”) x Wh"? (0,,R°) x L* (h, R”), (2.20) 
Def; := {(r,d, p) € By: detVd £0 ae. in 2}, (2.21) 
F? : Def; > R:= RU {—co, +00}, 


(r,d, p = (pep) > > farsi (r(x), (Vd)~*(2)) dax.(2.22) 


J=19), 
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By (2.22) the volumetric free energy FP (p:,®;', X+) of the crystal at time 
t € R+ is introduced. For the total free energy of the system at time t € Ry 
we write F;(p(t),®;*, y+) with 


F, : Def; > R, (r,d,p) = F?(r,d,p) + F£ (p). (2.23) 


For the subsequent formal derivation we assume that all functions are suff- 
ciently smooth. We extend the density vector p(t) by 0 to a function on the 
whole of R”. We use the notation Ov := (3v), where ð is an arbitrary 
differential operator. 

The evolution in time of the particle densities is described by the continuity 
equation 


plt) = -div F; in Q, (2.24) 


where F; := (F;,);_,, and F;, is the particle flux of species i, 1 < i < n, at 
time t € R,. In our case, Fi, consists of two components, the diffusive flux 
Ji, and the mechanical flux M;,. 

We write 


Fi = -Ji + Mi, (4 = (Se) M: = (Ma) ) ý tE R4. (2.25) 
For the mechanical flux we easily find 
Mi, = pi(t)O:®; 0 P7 in h, tER},1<i<n. (2.26) 


We introduce the notations p(t)3®, o B7" := (p(t) o Sr) and write 
(2.24) as 


Op(t) = div (Ji — p(t) 0 P7+) in Q, t E€ Ry, (2.27) 
or equivalently 
Oi (p(t) O Bidet VB) = div J; O &,.det VP; in Q, tE R+. (2.28) 


At fixed constant temperature the diffusive fluxes are caused by the neg- 
ative gradients of the chemical potentials which are the thermodynamic 
forces, [KK93], [dGM84]. According to Onsager’s postulate, [KY87], [Ons31a], 
[Ons31b], [AGM84], every thermodynamic flux is a linear combination of the 
thermodynamic forces. So we set 

















Ji, = XO LinVir, in h, 1<i<n, teR’, (2.29) 
k=1 
or Je = LV pe in Ri, (pt = (Mee) per), t © Re (2.30) 


with a symmetric and positive definite matrix L := (Lir); x-1 and the chem- 
ical potential u; of species 7, where the symmetry and positive definiteness of 
L comes from Onsager’s reciprocity relation, [Ons31a], [Ons31b], [4GM84]. 
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According to the definition of the chemical potential one has in 9, for any 
te R4 andi=1,...,n 


=E abd ), VP of, ) =X xa One Fi(0(t), (VF *)~*). (2.31) 
j=l 


Now we formulate the aforementioned minimality condition on the free energy. 
Considering the time-evolution of the deformation of a representative volume 
element and keeping in mind that the number of particles only changes due 
to diffusion, we find for two possible deformations 1, 4? the relation 


p (t) o Bidet VG} = p?(t) o det VG? in Q, (2.32) 


where p1 (t), p° (t) are the densities corresponding to P1, 82. So the minimality 
condition for any t € R+} reads 


F; (p(t), ®;*, xe) = _ min F, (a(t) o 87! det VE! 5, x), (2.33) 





eD, KEP: 
with 
Dr {sew : & =N,, 3P- ews, detY® > 0 a.e. in o} , (2.34) 
P, := 4 xEBV: Dxiatau(l —y)= 0j=1,...,m}$, (2.35) 
p(t) := p(t) TEA (2.36) 


for t € Ry and the setting BV := BV(Q;,R™), W := W13+9(Q,R°), WÈ := 
WE3+3 (Q, R3). 

To conclude, our model consists of the equations (2.4)—(2.6), (2.7)-(2.10), 
(2.13)-(2.15), (2.27) or (2.28), (2.29) or (2.30), (2.31) and (2.33). 


2.2 Solution strategy - implicit time discretisation 


The objective is to solve the model equations. To this end we discretise the 
equations implicitly in time. The ansatz is the same as in [Vis96], [Luc94], 
[LS95]. 
The following argument is only heuristic. We exploit the minimality con- 
dition on the free energy (2.33) and choose a suitable approximation of (2.28). 
If we formally consider the time derivative of F = F(p,®,x), we find 
(omitting the dependence on t) 


dF = Fop + ôs Fo, + OV FOX. 


From (2.33) it follows 0,F0;x = 0, 0,F0sp + pF = 0. Consequently, 
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dF = 0,F (+p — 06 piP). (2.37) 
Now we compute 06p0;® from (2.32) to obtain 
06 p0,8 = —pTr(VP~'VA, o P7!) — (Vp, Po 1) (2.38) 
= —div(p0,2 o P7!) = —divM, (2.39) 
where Tr(A) := > Akk is the trace of A € M°. If we plug (2.27) and (2.39) 
into (2.37) we iad! 
d,F = 0,F (div J — divM + divM) = 0,FdivJ. 


Assuming that the surface terms do not depend on p, it follows, see [BB92] 
p.58, 


0-2 Yue t)On, fj (Pl£), VPr o GF *(a)) div Ji, (x) dx 


II 
il iM | i 


D fme x)div J; (x)dz, teR4. (2.40) 
lo, 


Assuming further that the normal component of J; + vanishes on 02;, which 
follows from (2.28) and (2.9), we get with the inner product (a, b) := 5 akbk 
in R? i E = 

So ui divd;, = X. div(ui. Ja) — (Vli, Ja), t E€ Ry. 


With the divergence theorem and (2.30) we find 


























D. yt Vi (2), Jy (2)) dx = —2Qu( Je) = —2Q3 (V pu) (2.41) 
= Q0) — Qi (Vm), (2.42) 
where for t € Ry 
Qi: (LQ R)" >R, G5 [(EG,G)de, (243) 
2% 
Q% : (L7(,R3))" >R, Gu J LG) dx. (2.44) 
2% 


Here we introduced the symbol 
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n 


(-,-): (R?)” >R, (a,b) S (ar, br). (2.45) 


k=1 


In (2.41) and (2.42), Qf denotes the Fenchel conjugate to Q+. We call (2.42) 
the Q — Q*-formulation of the problem. In general, every system of equations 
originating from non-equilibrium thermodynamics can be written in the form 


dF+Q4+Q*+G+G* <0, 


where Q and G are certain convex functionals and Q*,G* are their convex 
conjugates. Therefore, Eq. (2.41) can be written in the form 


[ (vette). aivi(a)) dx + BIQ IAJ) = 0, tE R+, (2.46) 
2 


with (V, f, div) = 3 Y Xô fi (pi, (var) div J. 
i=1j=1 
Now we approximate (2.28) for given discrete step size h > 0 by 


p(t +h) = p(t) o 87t, det VG_! 


t+h t+h + hdivJi+n.- (2.47) 


We see that for known (p(t), Bt, X+), a minimiser (®, x, J) of the functional 
(Š, x, J) > Bin, X, J) = Fin (A0) oF Met VS, 8, z) + HQ) 
(2.48) 


satisfies (2.46). Additionally, due to p := A(t) o ®-'detV@-! + hdivJ, every 
minimiser fulfils the equation 


(Rage DOD J (Vpf(p, VB, x), dive) hI 


Nith 
=-h i (Vu, 6S) da +h / (L71J, 8J) dx 
Nith Nith 


and it holds J = LV u and (2.33) is fulfilled. 

Motivated by these considerations we arrive at the following implicit time 
discrete version of our original problem: 

Let p(t), Pt, X+ be the solutions of the problem at time t. Then p(t +h), 
Pr, and Xt+n are given by p(t +h) := A(t)o-'det VO"! + hdivJ, B; := ®, 
Xt ‘= X, where (®, x, J) is a minimiser of (2.48). 


3 The time-discrete system 


In this paragraph we show the existence of minimisers of (2.48) in a suitable 
function space. To this end it is necessary to make (2.48) precise. We make 
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further assumptions on the structure of f1,..., fm, which are motivated by 
the direct method in the calculus of variations, see [BB92], [Dac89] and (2.6), 
(2.10). 

First we want to extend Def; on B; (see (2.20), (2.21) ) and want to ensure 
that the domain where F; is finite is closed. The difficulty here is that in the 
definition of F; the inverse of the gradients of the deformations occur. The 
following ansatz solves this problem, taking (2.6) and (2.10) into account. 

For A € M? let 


Ag2A33 — Ao3A32 A13A32 — Á12Á33 A12A23 — A22 A23 
cofA := | Ag3A31 — Ag1A33  A11A33 — A13A31 A13A21 — A11 A23 
Ag, A32 — A2232 Ai2A31 — A1132 A11A22 — A12Aa1 


With this definition we have A cof A= (cof A) A= (det A) E3 := (det A) (0x1) 2 1—1- 
It is important to notice that for invertible A € M? it holds =+;cofA = A}. 
Therefore we make the following conditions on fi,..., fm: 
There exist gj : R?” x M? x M? xR — Rx such that for all (r, A) € R” x M? 
fi(r, A+) = g;(r, A, cof A, det A), if detA 4 0. (3.1) 





Furthermore we demand that for all (r, A, B, d) € R” x M? x M? x R 
gj(r, A, B,d) E€ R4 iff (r, A,B,d) EZ, 1<j<m, (3.2) 
with the admissible set 
Z = fea Baer" xm :d>0, ord <landr> o} . (3.3) 
i=1 


Here, the condition r > 0 for r € R” has to be understood componentwise. 
As given in (3.1), g; denotes the argument of Fẹ? in the position of fj. Now 
we define for t € Ry with the abbreviation d := (Vd, cof Vd, det Vd), d € W%, 


F? : Bi > R, (r,d, p) => 
x pj()g5(r(2),d(e))de, if stip f Ip; (x)| g(r(2), d(2)) dz < oo, 


+oo, otherwise, 


F;:Bı—>R, (r,d,p) > Ff (p) + F? (r,d,p). (3.5) 


(3.4) 


Since Qs is defined on (L?(92,R?))”, it remains to define the divergence on 
L?(%,R?). This is done in a way adapted to the equations such that the 
conservation of mass (2.9) and the implication (2.40)=(2.41) holds. 


div, : L?(Q,,R?) = WEO), 


div.j(€) := — [uve dz, €€W?(2,R"). (3.6) 
Ê 
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In the following we write div instead of div;. 


Remark. As is shown in [Anr06], for a lower semicontinuous convex function 
g : R” — R that satisfies Cyg(a) + C2 > ||x||1 for all x € R” and positive 
constants Cı, C2 and q > 1, one can define fe g(divJ) in a natural way by 
setting 

J slaivs) = +00, if divJ ¢ L7(2,R”). 

Q 

Now we are in the position to formulate (2.48) appropriately. Let t € R4, 

h > 0, p(t) and ®; be given such that (2.4)—(2.6) and (2.7)—(2.10) are satisfied. 
Then introduce 


Fish : B: x (L?(Q;,R*))” > R, 
(d, p, G) > Fi4n(A(t) o ddetVd + hdivG, d, p) +hQ:(G). (3.7) 


We call the variational problem Etn — min the time-discrete system. The 
existence of minimisers is ensured by the following theorem. 


Theorem 3.1. Lett € Ry and h > 0. In addition to the earlier assumptions, 


let fi,...,fm be lower semicontinuous and convex, satisfying 
ci fj(r, A, B,d) + & > ||AIP?, (3.8) 
j ; 5, Bie? 
c3f;(r, A, B, d) +c} = Tae’ (3.9) 
cd f;(r, A, B,d) + cå > |djt+3 (3.10) 











for all (r, A, B,d) € R” x M? x M? x R with non-negative constants ct, k = 
1,...,6. Then Ein possesses a minimiser (®,x, J) that fulfils (2.5), (2.6), 
(2.7)-(2.10) and (2.13)-(2.15) with p := p(t) o "det VO! + hdivJ. 


The proof of this theorem can be found in [Anr06]. 
Idea of proof: We use the direct method in the calculus of variations. First 
we show the lower semicontinuity of the functionals in a suitable topology 
(weak topology for J and ®, strong topology for x; a new proof for the 
lower semicontinuity of Fë with methods from elementary convex algebra is 
given in [Anr04]). For a minimising sequence (Pk, Xk, Jk) ren, Eq. (3.8) yields 
the boundedness of l8 llw, (3.9) gives the boundedness of ||®z||yys, (3.10) 
gives the boundedness of || det Voz, lla +4 which implies the boundedness of 
[div Jell g 

The norm ||xk||Bv can be estimated by Ff, see [Anr04], which guarantees 
the compactness in L!, whereas the L?-norm of J; is estimated by Qz. 

Exploiting the differentiability properties of functions in WÊ, see [GMS98], 
and properties of the weak convergence in these spaces, see [Cia88], the exis- 
tence of minimisers with the stated properties can be proved. 
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Corollary 3.2. If for 1 <i<n,1<j<m the derivatives 0,,f; exist in Z 
and if lim [ðr fj (Tk, Ar, Br, dk)| = +00 for every sequence (rk, An, Br, dk) ken 
—> CO. 


with jim (rk, Ar, Br, dk) = (r, A, B, d) € OZ, then for a minimiser of Thm. 3.1 





it holds J = Vu with pmi = > Xjôr fil, ®), L<i<n. 
j=l 


The complete proof of this statement is given in [Anr06]. 
Idea of proof: Since the formal method presented at the end of Sect. 2.2 
cannot be applied, we approximate fj by suitable smooth and convex functions 
f} from below and solve the corresponding variational problem Ek, n `~ min 
for which we have JE = Vu", pë = V,f*(p*, ®*) € Wt? (Q)). 

Using convexity arguments and elementary measure-theoretic results one 
can then show with the Poincaré inequality the existence of a subsequence 
with J"! —> J, Vuk — Vu and Vrf (p, b*) = V,f(p, B®) as l — œ. 


4 The continuous system 


According to Thm. 3.1 we can construct for given h > 0 a sequence of time- 
discrete solutions. We discuss the limit h — 0 and show that the discrete 
solutions converge in a sense that has yet to be specified to a solution of 
the original problem. As we have at most weak convergence, due to the non- 
linearity of F, we cannot expect that the weak limit satisfies (2.33). Addi- 
tionally, the equations do not provide a condition on &; and xz in time. If we 
consider the problem on a fixed given time interval [0, T] for T > 0 and regard 
p, ®, x, J, u as mappings from [0, T] to a certain topological space X, we notice 
the analogy to Young measures that yield solutions to our problem in case X 
is finite-dimensional (or locally compact), see [Eva91], [Mül04], [Ped99]. 


4.1 Formulation of the problem with Young measures 


As the domain 2; is time dependent, so is the function space X; containing 
p(t), Bt, Xt, Jt, ut. The space X := Utepo,7] X; has no ‘nice’ topological prop- 
erties. Therefore we consider the quantities A(t) := p(t) o BdetV 8, Ês := 
vy, o Bi, Xt = Xto Vr, Jt := Jio Wr, fy = ueo W% on the reference domain 2 to 
formulate the equations. So we transform with ®; respectively % for t € Ry. 
The corresponding time-discrete solutions exist according to Thm. 3.1 since 
B, possesses the transformation property, see [GMS98]. In the following, â al- 
ways denotes the transform of a. In analogy to the common weak formulation, 
the measure-valued formulation reads: 

Let X := L?(Q,R") x W? x L1+3(Q,R™) x (L?(2, R?))” x W1?(2,R") 
be equipped with the product topology of the weak topology in coordinates 
2, 4, 5 and the strong topology in coordinates 1 and 3. We look for a mapping 
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P : [0, T] > R(Ñ), t> Pı, where R(X) is the space of signed Radon measures 
over X with P; > 0 and P;(X) = 1 for almost all t € [0,7] such that (with 


& := (Ô, $, R, J, fi) 
T 


foo f [oreaavare(e) at 
0 x 2 


T: 
= J O(t) J J J ly) V (Eo È! o W7 t) o%(y)det VY (y) dy dPi(ê) dt, (4.1) 
0 % 2 
T 
fow f f EDE Fnacevuity) dy aP(a) at 
0 x 2 


T 
= J v(t) / / jily)div(é 0 W-1) 0 V(y)det VY; (y) dy dPi(@) dt, (4.2) 
0 rane) 


Oy 
S 
Yah. 
x 
o 
> 
A 
< 
x 
MM, 
arne 
=> 
a 
oO 
ot 
< 
S 
eer: 
= 
a 
«e 
a 
x 
een: 
& 
Q 
~ 


X 
T 

= foo | [Xx odwa.F (E) kw) dyar(aar (43) 
0 % 2 


for all 9 € C%([0,7]), € € CY (2, R”) and 


supp P; C {0 J, i) EX: p € supp P| g,» 


P(o) = min FSO} (44) 
(B, X)EX2X X3 


for almost all t € [0, T]. Here, X, denotes the l-th component of x ,1<l1<5. 
Furthermore, let the conditions analogous to (2.4)—(2.6), (2.7)—(2.10), (2.13)- 
(2.15) be fulfilled for almost all t € [0,7] on supp P+. 


4.2 Construction of the Young measures 


The key to the proof of existence of measure-valued solutions to the continuous 
problem is given by the following theorem. 


Theorem 4.1. (Analogon to Young measures in the infinite setting) 
Let Ir := [0,T] C R for T > 0 and àr be the Lebesgue measure on Ir, (Vi)ien 
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be a sequence of positive Radon measures on Ir with ni > Ar for i— œ, let 
Xı, Xo be Banach spaces with separable Xï, XZ, define 


(X, | i lx) = (Xı x Xo, | i xa T | ; ll); (X, T) = (Xı x Xo, | f lx x wx), 


and (yi : [0,T] > X)ien be a mapping. 
Tf K? CC (Xi, | i lx); Kö CC (X2, wx.) and 
1 
vi (M? := {t € Ip: yilt) € KT x KZ}) < -— forieN, (4.5) 


n 


then there exists a subsequence (Yi, )ken and a mapping P : Ir > R(X) with 


P>0, Pi(X)=1 for almost all t € Ir (4.6) 
and 
(0,T]> t> frea dP: (2) € R) €L*((0,T)), (4.7) 
T 
jim fr , Vin (t)) dvi, (t =f jn t, £) dPilx (4.8) 
0 


for all f € Cy([0,T] x X). 


Corollary 4.2. Additional to the assumptions of Thm. 4.1 let there exist a 
q > 0 such that for all i € N 


Iall € L'r) and f oig dat) < F. (4.9) 


Let f : (Ir x X,|-| x 7) — R fulfil for a constant C > 0 
f(t,2) < C (1+ |x|) for all (t,x) € Ir x X. (4.10) 


Let f be bounded from below and let f be either lower semi-continuous or lower 
semicontinuous with respect to the second argument and satisfy a uniform 
continuity in time, i.e. for any n € N and given e > 0 there exists a ô(n, e€) 
with |f(x,t)— f(x, t')| < e for |t—-t'| < ô(n, €), t,t € Ir and all x € K? x KY. 
If one of these two conditions is met, it follows 


f(t,-) € L! (X, Bx, P+) for almost all t € Ir, (4.11) 
[0, T] > t= [He dP,(x) dt € R | € L([0,7)), (4.12) 
T 


imar fro Vix (t)) dvi, (t z f fn t,x) dPi(« (4.13) 


0 
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A continuous function f that is not necessarily bounded from below and sat- 


isfies (4.10) fulfils 


lim afr eau anit o= f frea PA (4.14) 
0 


We remind that a cylinder function is defined as follows: 


Definition 4.3. Let X be a topological vector space. A function f : X — R 
is called a cylinder function on X if for some p € N there exists a a € X*” 
and a g € Ca (R?) such that f has the representation f = goa. The symbol 
Zx denotes the set of all cylinder functions on X. 


Corollary 4.4. From the assumptions of Thm. 4.1 it follows 


Pt (x1 U Ky x rs) =0 for almost all t € Ir. (4.15) 


n=1 


The longer, technical proofs of this statement can be found in [Anr06]. Crucial 
is the following Lemma that is also proved in [Anr06]. 


Lemma 4.5. (Approximation Lemma.) Let X be a Banach space, f : X > R 
be strongly continuous, g : X — R be weakly continuous, K C X strongly 
compact and L C X weakly compact. Then for any e > 0 there exist cylinder 
functions [,: X — R and O; : X — R with 


max |f (2) — Fe(e)| < € and max |g(x) — Oe(2)| < e. 


Pe 2 z 
In addition it holds max |re(£)| < max |f(x)| and max |O.(x)| < max |g(x)| 


Remark. The approximating cylinder functions can be chosen as elements of 
a fixed countable set. 


Idea of proof: The strategy to prove Thm. 4.1 is to first show the statements 
in the finite-dimensional case using slicing theorems, [Ped04],, and then to 
approximate with cylinder functions. Essentially, the extension to the infinite- 
dimensional case is an application of Riesz’ representation theorem for positive 
linear forms on the space of continuous functions on completely regular spaces. 

The proofs of the corollaries rely on the fact that lower continuous func- 
tions can be approximated on compact sets from below by continuous func- 
tions, see [Anr06]. 


4.3 Measure valued solutions 


In earlier sections we have established the mathematical tools needed to formu- 
late the final theorem. The proof of the next theorem is contained in [Anr06]. 
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Theorem 4.6. The continuous system (2.4)-(2.6), (2.7)-(2.10), (2.13)-(2.15), 
(2.27) or (2.28), (2.29) or (2.80), (2.381) and (2.33) possesses a solution in 
the sense of Sect. 4.1 if the requirements of Thm. 3.1 and Corollary 3.2 are 
met. There exists p € L?((0,T] x Q,R”) with Pi = dae) x Pi for almost all 
t € [0,7] and the following energy inequality holds: 

T2 


E(t) — E (71) < - fw) + Q*(t) dt for almost all T1, T2 € [0, T], (4.16) 


TL 























where 
E: [0, T] > R, tro f AaS t dP, (x), (4.17) 
x 
Q: [0,7] > R, to fA dP, (2), (4.18) 
X 
Q* : [0, T] > R, to [OVa dP, (2). (4.19) 
x 


Idea of proof: (All subsequences are labelled as the original sequence; we 
use the abbreviation ĉ := (P, ®, $, J, /2).) 

For i € N let h(i) := Z. Then, according to Thm. 3.1 and Corollary 3.2, 
we can construct recursively a finite sequence (îm) o which solves the 
time-discrete problem. Now we define ĉ; for i € N as the step function cor- 
responding to the sequence which is continuous from left. One can show that 
(r: = (pi, &;, Ši, de, ju:)) x satisfies the assumptions of Thm. 4.1. In the next 

i€ 
step one proves with the help of Kolmogoroff’s compactness criterium, see 
[W1082], the existence of a subsequence with the property 


For a given smooth Dirac sequence (¢;);en there exists for every j € N a 
ĝi € L?((0, T] x 2, R”) such that 


k—oo 


T 
: N 2 
im f fea- p(t.2) dx dt = 0. 
0 2 


Furthermore we need: 


Lemma 4.7. Let X be a T3a-space, v E€ PM(X) with v(X) = 1 and g € 
C(X, X). If for every f E€ (X) 


[ Pl) ae) = [runa =0, (4.20) 
X 


then there exists xo E€ X with g(x) = xo for all x € suppv. 
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Applying Thm. 4.1 and the above Lemma on the subsequence (7;)ien, we find 
the existence of ô with P; = dat) x P, for almost all t € [0, T]. 

With the exception of (4.2) and (4.4), the remaining equations follow es- 
sentially from Thm. 4.1 and the Corollaries 4.2 and 4.4. 

For the proof of the minimality condition (4.4), we show with (4.1) that 


E(t)= min _ F(t,A(t),®,x) forte (0,7). 
(P,x)EX2x X3 


The validity of (4.2) relies on the subgradient-inequality 


Ip (5 — p) dy dP,(ê je p p) dy dP: (ê), 


where (4.4) is used. 
The proof of the energy inequality is based on the following considerations. 
Define for i € N, 1 < k < 2¢ and t := kh(i) 


Jẹ := argmin [F; (p;(t) + h(i)divJ, Ba, Xt) + R)QI)], 
JEL? (Q+, R3)” 
pint(t) := p(t) + hdiv Jim 
and the corresponding continuation for t 4 kh(i). Then it holds 


h(i) ui div I = p (pi — pi) > fy (PE, i) — H (pi, Bs) 


li 


and (Young’s inequality) 


Qi (12°) +Q (Vee, t) = J Vi A) dz 


2 


m 
= -5 fx ppt div Jif dx 
I=L, 


fori EN, 1< j <m, te [0, T] and « € 2. This yields 
F, (p aq t), Br Xt) — F (p(t J, Pt, Xt; ) 
< —h(i) [Q (JE") +Q; (Vpirt)] 
which can be rewritten as 
Fi (om (t ), r,,Xt;) ag h(i)Qi (Je) 
< Fy (pilt), Bt, Xt) — h(i LQ? (Vpin’) : 


Therefore, due to 
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Fina (Pilt + h(t), Barrio Xarri) + KO Qn (Jatra) 
< F (È (t), Bis, Xt) + h(i)Qi Ce) ’ 


we obtain the estimate 


Fipra) (Pilt + h(4)), Barri Xarri) — Fe (A E), Ge, Xt) 
< h(i) [Quno (Jarno) +Q (Vuir’)|. (4.21) 


Next, the inequality (4.21) is rewritten in terms of Ê ; Ô, Oint- Then we con- 
sider the two sequences 


(x := (Ôi, Di, Ris Ji, Îi, jit.) ; and (? = (ping; , Bi, Šis Ji, Îi, fits) ; 
iEN iEN 

and show that they generate the same measure Ď, = Oat) X P, for almost all 

t € [0, T]. From (4.21) it follows (4.16) with the corresponding Q#_,. Estimating 


the subgradient inequality we can finally show Qi, = Q*. 
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Summary. We address the numerical analysis of relaxed formulations for scalar 
and vectorial nonconvex variational problems originating from models for solid-solid 
phase transitions and crystal plasticity. We discuss algorithms for the approximation 
of the quasiconvex envelope using laminates, rank-one convexity, and polyconvexity, 
and present some numerical applications to benchmarks problems, and to a model 
for single-slip crystal plasticity. 


1 Introduction 


Variational models based on nonlinear elasticity, and their mathematical anal- 
ysis, have proved useful for the study of phase transitions and microstructures 
in elastic solids, starting with the seminal work of Ball & James [BJ87, BJ92]. 
The methods of relaxation have in some cases lead to new understandings 
on the mesoscopic phase diagram [DSD02], on the microscopic origin of com- 
plex domain patterns [KM94], and on geometrical conditions relevant for the 
design of new devices and materials [Bha03]. 
Mathematically, one minimizes the functional 


E(u) = f W (Du) dx (1.1) 
Q 
over the set of admissible deformations u : 2 C R” — R”, with u € 


W1?(Q;R™), u = up on OM, and W the energy density of the crystal. Here 
W?1?(Q;IR™) denotes standard Sobolev spaces with p € (1,00) related to the 
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growth of W, and (1.1) may include lower-order terms representing external 
forces. 

One says that the functional E predicts a microstructure if a minimum 
does not exist, and gradients of infimizing sequences exhibit oscillations on 
finer and finer scales. Objective of the research is the analysis and numerical 
simulation of those infimizing sequences and/or their most relevant features. 

The determination of low-energy states of such functionals E by standard 
finite element methods will typically yield mesh dependent results with oscil- 
lations in Dup on a length scale comparable with the mesh size. Further, the 
computations can be very sensitive to mesh orientation and miss completely 
the description of the real microstructural configuration. For instance, the 
precise characterization of the minimizers of a non-convex problem in [BP04] 
shows that they develop complicated branching structures and are therefore 
difficult to detect numerically. One is therefore interested in alternative ap- 
proaches, which do not attempt a direct numerical minimization, and is lead 
to the concept of relaxation. 

From a physical point of view, relaxation focuses on macroscopic features 
and on the average material behaviour, rather than on the details of the mi- 
crostructure. This means that one operates a separation of scales, and tries to 
extract from the microscale all information that is relevant for the macroscale, 
and no more. The macroscopic deformation is then determined by studying 
a problem which contains an effective energy density, which automatically 
accounts for the optimal local microstructure. 

From a mathematical point of view, relaxation theory aims to replace an 
ill posed problem with a well posed one (at least as far as existence is con- 
cerned), preserving the essential features of the original problem. This can 
be achieved following basically two approaches. The first option is to enlarge 
the class of the competing functions, allowing for measure-valued solutions 
fYou80, Ped97, Rou96]. The second one is to focus on the weak limits of 
infimizing sequences. Weak convergence, which qualitatively corresponds to 
convergence of averages, eliminates the fine-scale oscillations and gives a limit 
which only contains information on the macroscopic scale. The idea is, there- 
fore, to study the behaviour of infimizing sequences by characterizing their 
limit points as minimizers of a new functional [But89, Dac89, Mue99]. 

Lack of strong convergence of infimizing sequences, and lack of existence 
of a minimizer, is strictly related to the lack of weak lower semicontinuity 
of the functional (1.1) on the space W1?(Q;R™), which in turn is equiva- 
lent to quasiconvexity of W, under suitable continuity and growth conditions 
[Dac89, Mue99]. Precisely, if W is coercive then weak sequential lower semi- 
continuity (and hence quasiconvexity) is a sufficient condition for the existence 
of minimizers. If instead W is not quasiconvex, then one must expect fine-scale 
oscillations in the gradients of infimizing sequences. The relaxation of E(w) 
is achieved in this case by replacing W with its quasiconvex envelope W“%, 
the largest quasiconvex function bounded from above by W. Knowledge of 
W° would permit an accurate simulation of the macroscopic features of E. 
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Unfortunately, analytical formulas for the quasiconvex envelope are known 
only for very few energy densities W. Consequently, one is interested to nu- 
merical relaxation, which aims at an efficient approximation of W4°. This 
will be illustrated and discussed below, considering model examples of mi- 
crostructures in phase transitions and in elastoplasticity described by scalar 
and vector nonconvex variational problems. We call the minimization of (1.1) 
scalar ifn Am := min{n,m} = 1, vector otherwise. 

The remaining part of the paper is organized as follows. Sect. 2 analyzes 
generalized and relaxed formulations of a scalar nonconvex minimization prob- 
lem for a two-well energy density. Sect. 3 deals, instead, with nonconvex vector 
variational problems by introducing the notions of quasiconvexity, rank-one 
convexity, and polyconvexity. Sect. 4 describes numerical algorithms for the 
evaluation of the rank-one convex and polyconvex envelope as approximation 
of the quasiconvex envelope. Applications to models for microstructure in 
phase transitions and plasticity are given in Sect. 5. Finally, Sect. 6 concludes 
the paper with some observations. 





2 The scalar double-well problem and its relaxation 


We consider in this section the anti-plane shear simplification of the Ericksen- 
James energy density 


W(F):=|F-FiP|F-F|? for Fe R’, (2.1) 


with F,, Fh € R?, F = Du, and u : 2 C R? — R, as a typical example of a 
scalar nonconvex minimization problem. This reads as follows. 


Problem 2.1. Seek u € A that minimizes 
E(u) =| W (Du) dx +a f lu — f|? dz, (P) 
Q R 


over the set of admissible functions A := up + WẸ * (2; R), with up € 
W?+4(Q;R) prescribed, a > 0, and f € L?(Q;R). 


As discussed in the Introduction, direct minimization of (P) is difficult [Lus96]. 
The rest of this section discusses alternative approaches. Precisely, in Sect. 2.1 
we introduce the concept of Young measures, and in Subsections 2.2 and 2.3 
generalizations of (P) with Young measures (problem (GP)) and by convex- 
ification (problem (CP)) are discussed. Convergence of adaptive mesh refine- 
ment algorithms is discussed in the Sect. 2.4, whereas Sect. 2.5 summarizes 
the main results for the formulations (CP) and (GP) for an ad hoc extension 
to 2D of the broken Tartar problem [NW93] developed in [CJ03]. 
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2.1 Young measures capture oscillations 


Infimizing sequences (ue) for (P) are typically weakly but not strongly con- 
vergent in W1:?(Q;IR) and the corresponding weak limits are in general not 
solutions of (P), because of the lack of weak lower semicontinuity. Young 
measures provide the mathematical tool for representing the weak-» limit, 
whenever it exists, of sequences (f(ue)) C L®(Q; R) with f € Co(R; R). Here 
Co(R™;R) for m > 1 is the space of the functions f € C(IR™;R) such that 
lim)2|00f(#) = 0. Definitions and properties of Young measures are given 
next in relation to their use for sequences (ue) C L®(Q C R”; R™). 


Definition 2.2 ([Mue99]). Denote with M(R™) the set of all finite signed 
Radon measures supported in R™, and with LX (Q; M(R™)) the space of func- 
tions v = (Vz)veq defined in N C R” and with values in M(R™) such that 


(v; g) : R >R, z= (wmo) = f g dva 


are measurable for all g E€ Co(R™; R). Let YM(2;R™) be the set of all v € 
D°°(2; M(R™)) which are probability measures, i.e. vy > 0 and vs(R™)=1 
for almost all x € N. The elements of Y M(N; R™) are called Young measures. 


Theorem 2.3 (Existence theorem for Young measures [Mue99]). As- 
sume the sequence (ue) bounded in L®(N;R™). Then there exists a com- 
pact set K C R™, a subsequence (ue,;) C (ue), and a Young measure 
v = (Vz)2eq E LE (2; M(R™)) such that: 


(i) supp vz C K a.e. in 2 
(ti) for each f € Co(R™;R) we have 


f f (ue, )h dx > | fhdx for every h € L'(Q;R), (2.2) 
Q Q 


where 


f(a) = a3 f) = fdv, for a.e. x E€ NQ. (2.3) 
R™ 
Definition 2.4. We call v = (vz)zeq in Thm. 2.3 the Young measure asso- 
ciated with (or generated by) the sequence (ue,). 


Remark 2.5. (i) From Thm. 2.3 one obtains a criterion for strong convergence, 
and consequently a criterion to decide on the occurrence or not of oscillations. 
Given up Š u in L® (Q; R”), then ue > u strongly in L?(Q;R™) with p < co 
if and only if vz = 6,2) a.e. in Q [Mue99]. 

(ii) By making specific choices for f, we can read off certain information 
regarding the structure of the Young measures. For instance, if ug > u in 
L®(N; R”) and f = id in a neighbourhood of supp v, then 
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u(x) = Adv,(X) 


R™ 
where (Vz)zeq is the Young measure associated with (uz). 


Since microstructures are associated with oscillations in the gradients of in- 
fimizing sequences, one is mainly interested in understanding what are the 
Young measures associated with the sequence (Dug). These are called gradi- 
ent Young measures. 


Definition 2.6 ([Mue99]). An element v € L(2;M(R™)) is called a 
W!~-gradient Young measure generated by (ue) if it is the Young measure 
generated by the sequence of gradients (Due) with (ue) weakly-* convergent in 
Wi°(2;R™). 


Remark 2.7. (i) From eq. (2.2) with f = id in a neighbourhood of supp v, the 
weak-x limit Du of a sequence of gradients (Due) for an infimizing sequence 
(ue) for (P) is related to the gradient Young measure generated by (ue) by 


Du(x) = Í Fdvs(F) = (vz;id}) a.e. in 2. (2.4) 
R2 
Therefore the gradient Young measure v permits to compute the macro- 


scopic strain Du. Analogously, specifying f = DW around supp v in (2.2), if 
(DW (Due)) is weakly-* convergent to some o, one obtains 


a(x) = DW (F)dvz(F) = (vg; DW) ae. in Q. (2.5) 
R2 


(ii) Specifying then f = W around supp v in (2.2) one has 


lim | W(Dug) dx = f (Vz; W) dz. (2.6) 
law jQ Q 


2.2 Relaxation with Young measures and their numerical 
approximation 


Equation (2.6) along with (2.4) motivate the following generalized problem. 


Problem 2.8. Seek a minimizer (u,v) € B of 
GE(u,v) =f (vz, W) acta f lu — f|? dx (GP) 
R Q 


over B := { (u,v) € Ax YM(Q;R?) : Du(x) = (vz; id) for ae. x € R}. 


The relevance of problem (GP) follows from relaxation theory. 
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Theorem 2.9 ([Rou96, Thm. 5.2.1][Ped97, Thm. 4.4]). 
Problem (GP) has a solution and there holds 

inf E(u) = min GE(u,v). 

ucA (u,v)EB 
Moreover, if (ue) is a weakly convergent infimizing sequence for (P), with 
weak limit u, that generates the gradient Young measure v, then (u,v) is a 
minimizer for (GP). Vice versa, if (u,v) is a solution of (GP) then there is 
a weakly convergent infimizing sequence (ue) such that its weak limit is u and 
v is the Young measured generated by (ue). 


Remark 2.10. Given v € LX (N; M(R?), the compatibility condition (v,;id) = 
Du(x) with u € W'°(Q;R), and suppv, C K compact subset of R?, char- 
acterize in the scalar case completely the gradient Young measures associated 
with sequences. 


Numerical approximations of (GP) have been proposed in [NW93, CR98, 
Rou96a, Ped95, KMRO05]. Those involve a discretization of the admissible set 
A x Y M(Q;IR?) and care of the differential constraint. 

Within a finite element scheme, denote with 7 a regular triangulation 
of 2, and by E and N the set of all edges and vertices, respectively. Then 
introduce the following finite dimensional spaces 


S'(T) :={v € C(Q): VT € T, vplz is affine}, 
Si (T) ={fun € S'(T) : vn = 0 on ON}. 


Let K := NNO denote the set of free nodes, the Dirichlet boundary conditions 
up are discretized by nodal interpolation, i.e. up,, € S'(T) with 


up,n(z) = up(z) if z € K and up p(z) =O0if zEN\K. 


A conforming finite element method of (GP) is obtained by replacing the space 
A with An := up,n + S(T) whereas the set of Young measures Y M(Q; R?) 
is approximated by element-wise constant measures, i.e. homogeneous Young 
measures Vr expressed as a convex combination of Dirac measures supported 
at the nodes of a triangulation 7 of a convex polygonal domain w C R? with 
mesh size d. That is, denote by Na(w) the set of nodes of the triangulation 7 
of w, we assume 

VT,d = >D ary OF rp; (2.7) 

Fr,3€Na(w) 


with known atoms Fr; € Na(w), and unknown coefficients ar,;. We denote 
this set with £°(T;PM)p,a) where PMn a is the set of probability measures 
expressed as in (2.7). Consider the set 


Bria = {(Un, Und) E An X LUE: PManh,a) : YT € T, Dop|r = (un,alT; id)}, 
(2.8) 
the discrete generalized problem reads 
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Problem 2.11. Seek (un, Vh,a) E€ Bria such that 
Minimize GE(un, Vha) over Bh, a- (GPp,a) 


An existence result for (GP),,a) follows as for (GP). Let £°(T;R?) be the 
set of piecewise constant functions on 7 with values in R?, W** the convex 
envelope of W (defined in Sect. 2.3) and WS = (P+W )** the convex envelope 
of P W, the nodal interpolation of W associated with the triangulation 7 of 
w. Let o = DW**(Du) for a solution u € A of (CP) (see Problem 2.13), then 
we have the following a-priori and a-posteriori error bounds 


Theorem 2.12 ([Bar04, Thm. 4.6 & Thm. 4.8]). Assume u € A solution 
of (CP), (tn, Vha) € Bn a solution of (GPh a), and Ana € L?(T; R?) the 
Lagrange multiplier associated with the constraint Dup|r = (un alr;id). Then, 
there holds 


lo — Anall <C inf (||V(u— va) + allu — vall) 
vnE€An 
+ CIOWE — DW** lret); (2.9) 


lo = Anal? < C4 CZ RNA + divAna + 2alf = un) ID? 
TET 





+ (30 heln a: nall?) + OWS - DW™ lre 
REE 
+ [nè akun/ðslaro} | (2.10) 


Since ||OWS — DW**|| can be bounded from above in terms of grid size d 
and D?W**, together with the density of the finite element spaces in A, from 
(2.9) one proves àn ,a > o in L? as h, d > 0, whereas (2.10) represents a basic 
ingredient of the multilevel adaptive scheme for the definition of the support 
of the Young measures developed by [Bar04]. 


2.3 Relaxation via convex envelopes 


By minimizing the two contributions in (GP) separately one obtains another 
relaxation of (P). For fixed F = Du one can find a probability measure 
V = (Vz)eeq such that v minimizes the expression (u;W) among all the 
probability measures pu satisfying (u; id) = F. In some cases, it is also possible 
to obtain an explicit expression for the convex hull of W, defined by 

W**(F)= min (u,W). (2.11) 

HEY M(Q;R2) 
(wid) =F 
The notation is motivated by the fact that for continuous W the convex 
envelope coincides with the bipolar function. We recall that, since we are in 
the scalar case, convexity and quasiconvexity coincide. 
Problem (GP) and (2.11) motivate to consider the convexified problem. 
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Problem 2.13. Seek u € A that minimizes 
E(u) = W**(Du) deta f lu — f|? dz. (CP) 
R R 


Likewise problem (GP), the importance of problem (CP) follows from 
relaxation theory. 


Theorem 2.14 ([Dac89]). Problem (CP) has a solution and there holds 


inf E(u) = minE® (u). (2.12) 
ucA ucA 
Moreover, if (ue) is a weakly convergent minimizing sequence of (P) and u is 
its weak limit, then u is a solution of (CP). Vice versa, if u is a solution of 
(CP) then there exists a weakly convergent minimizing sequence of (P) having 


u as weak limit. The stress field o = DW**(u) is unique and independent of 
u among the solutions of (CP) [Fri94/. 


Remark 2.15. Whenever a > 0 in (P), problem (CP) admits a unique solution 
u. If a = 0, however, the numerical treatment of (CP) requires the introduc- 
tion of a perturbation in EF, usually in the form of a strictly convex functional 
scaled by a small quantity. The introduction of the stabilized term finds its 
justification in the need of including some kind of ‘selection mechanism’ in 
the model which (i) ensures uniqueness on discrete level, (ii) is necessary for 
the design of convergent iterative solvers [BCHH04], and (iii) with some sta- 
bilizations terms for standard low-order finite element methods yields strong 
convergence of the gradients [BCPP04]. The same happens when dealing with 
quasiconvex envelopes in the vectorial case, see, e.g., [CDD02]. 


2.4 Adaptive finite element methods for relaxed formulations 
An h-finite element adaptive algorithm consists of successive loops of the form 
SOLVE — ESTIMATE — MARK — REFINE (2.13) 


designed to produce with less computational effort more efficient meshes by 
targeted local refinements. The use of such algorithms for the direct finite ele- 
ment minimization, however, does not always lead to an improved convergence 
rate in the stress error and also unclear is its convergence. For degenerately 
convex problems with Ct energy density W characterized for some constants 
p,r,s by the conditions 


|DW(A) — DW(B)|" < c(1 + |A|? + |Bl*)(W(B) — W(A) 
—DW(A) -(B—A)), (2.14) 


a(l AP — 1) < W(A) < e(l AP +1), 


to hold for all A, B € R”, [Car06] proves the convergence of the stress fields 
T0, 01, ... produced by (2.13) to o = DW**(Du) in L"/@+ts/P)(9; R?). In the 
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algorithm (2.13), the step MARK is realized by the criterion introduced by 
[Dor96] where one marks the edges E € M C E such that O Dpce nh < 
Xerem n% with ng the edge contribution to the global error estimator. In the 
step REFINE , on the other hand, one refines each triangle T with an edge 
in M such that an inner node is created, with possible further refinements 
that guarantee that ||; D f||L2(2;r2) tends to zero as j — oo and the resulting 


triangulation is regular. 


2.5 A 2D scalar benchmark problem 


In this section we report on the analysis of (P) in the particular case of 
a = 1, f and up given in [CJ03], 2 = (0,1) x (0,3/2), and the two wells 
F, := —(3,2)/V13 and F> = —F,. The convex envelope W** was computed 
in [CP97], and is 


W** (F) = (IF? — 14)’ + 4(FP — (3,2) - F)*) (2.15) 





with (-)4 := max{0,-} and the symbol - the inner product in R?. From relax- 
ation theory, we have the following result. 


Theorem 2.16 ([CJ03]). The problem (CP) has a unique solution u € A 


we) = min E**(v) = E**(u), (2.16) 


characterized as the solution of the Euler-Lagrange equation 
i) o- Duda+ 2 | (u — f)udx =0 for all v € Wo'*(2;R), (2.17) 
R R 


where o := DW**(Du). Furthermore, any infimizing sequence (ue) of (P) is 
bounded in W:4(Q;R) and generates a sequence of stresses og := DW (Dug) 
convergent in measure toward o = DW**(Du). 


For this problem, moreover, one obtains an analytical expression for the 
gradient Young measure which is unique and is given by 








Ve = A(F)s (r) + (1 — A(F))ds_ (F), (2.18) 
with F = Du and 
1 
A) = pU P |PF|?)~*/2) € [0,1], (2.19) 
PF + Po(1 — |PF|?)~1/? for |F| < 1, 
S+(F) = (2.20) 
F for 1 < |F], 


where P = I — F; ® F> (with @ tensor product of vectors of R°). 
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Remark 2.17. Since og converges toward o, from (2.5) the stress field o = 
DW**(Du) can then be represented as 


a(x) = DW diz, (2.21) 
R2 
with v given in (2.18) [Fri94]. 
With the notation of Sect. 2.2 the Galerkin discretization of (2.17) reads 
Problem 2.18. Seek up € Apn such that 


f on: Dup dx + 2 | (un — f)un dx = 0 for all vn € Sà (T) (CP) 
R Q 


with on := DW**(Dup) 


Strong convergence in L*/3(Q;R?) of the stress fields op results from the a 
priori error estimate [CP97] 


lo — On || 14/3(2;R2) < CA A — Vnllw24(0;R): (2.22) 


This is obtained using the condition 


|DW**(A)—DW**(B)|? < c(1+|A|?+|B|?)(DW**(B)—DW**(A)) : (B—A) 

(2.23) 
that holds for any A, B € R? together with some p = 4 and q = 3 growth con- 
ditions on W and on DW**, respectively. Another application of (2.23) also 
shows the reliability of residual and averaging based error estimates whereas 
the efficiency follows from standard arguments; that is, one has also 


conm — h.o.t. < |lo — onl r4/s(0,R2) S con? + h.o.t. (2.24) 


The minimal averaging error estimator ny that enters (2.24) is defined by 


3/4 m 
nm = (>. me)? for m = llon — of ll za/scrsmzy; 
TET 


with o* € S'(T)? that minimizes 
llon = Th || L4/3(2;R?) among Tp €E Sty: 


Figure 2.1 displays experimental convergence rates for ||o — op || 14/3(.2;R2) 


and the error estimators nj and mye for uniform and adaptive mesh refine- 


ment. The adaptive refinement strategy leads to significantly reduced error 
and improved experimental convergence rates. 


Remark 2.19. The two-sided estimates (2.24) shows that lower bounds are no 
valid upper bounds and vice versa, due to the different exponents for nas 
in the reliability and efficiency estimate. This miss balance is referred to as 
reliability-efficiency gap [CJ03]. 
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Fig. 2.1. The 2D benchmark problem. Experimental convergence rates for ||o — 


Tn || 14/3 (2R) and the error estimators ni and mi? plotted against degrees of free- 
dom N with a logarithmic scale for uniform and adaptive mesh refinement. 


3 Nonconvex vector variational problems 


For scalar nonconvex variational problems the convexity of W(Du) with re- 
spect to F = Du ae the weak (weak-«) sequential lower semicontinuity 
of the functional E(u) = fe W(Du) on Wt?(Q;R™) for 1 < p < œ (resp. 
p = œ). Along mn lee Ka conditions on W, one can prove the ex- 
istence of minimizers using the direct method of the calculus of variations. In 
the vectorial case a weaker condition is sufficient, namely, quasiconvexity. 


3.1 Quasiconvexity and effective energy density 


Quasiconvexity was introduced by Morrey in 1952 as a condition on the energy 
density W which is equivalent, under appropriate growth conditions, to weak 
sequential lower semicontinuity of the functional Æ [Mor52]. 


Definition 3.1. Given a function W : R?*3 — R, we say that W is quasi- 

conver at F € R°*3 if for every open and bounded set w C RÌ one has 

f W(F + Dy(x)) dx > i W(F) dz = |w|W(F) for each y € Wo’? (w; R?). 
i a (3.1) 


kar | lies at the heart of the relaxation theory for functionals of 
the type E(u) = fa W o W(Du) dz. An instrumental role is played by the quasi- 
convex mee of W ee as the pointwise supremum of the quasiconvex 
functions that are bounded from above by W, i.e., for each F € R°*?, 


W*(F) = sup {f(F) : f < W with f quasiconvex}. (3.2) 
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Under suitable growth conditions on W, the weakly (weakly-* for p = co) 
sequentially lower semicontinuous envelope of the functional 


Pas f W (Du) de + L(w), (3.3) 
R 
has the following integral representation 
E(u) = | W (Du) dx + L(u). (3.4) 
Q 


Here £(u) is a linear term representing external forces. 
The link between the minimization of (3.3) and (3.4) is given by relaxation 
theory. 


Theorem 3.2 ([Dac89, Mue99]). Let up € W1?(Q;R°) be fixed, A = 
up + Wo’? (Q;R3), and assume W to have p-growth and be p-coercive. Then 
the relaxed problem 


Minimize ET (u) amongst u € A, (QP) 
has a solution and there holds 


min E(u) = U (3.5) 


Furthermore, any solution u of (QP) is the weak limit of an infimizing se- 
quence for (3.3). 


The quasiconvex envelope of W at F can be characterized equivalently as 
[Dac89, Mue99] 


W%(F) = W(Dy(«)) de. (3.6) 


1 
inf — 
yew, (w;R3) jw] O 
y=Fea on ðw 


Inequality (3.1) states that the deformation u(x) = Fx is a minimizer of 
Jo W(Dy) da subject to its own boundary values. As such W4°(F) represents 
the infimum of the average energy taken over all possible microstructures 
y = y(x) that satisfy the boundary condition y(x) = Fx on 02, with the 
least energy achieved by the deformation y = Fz itself. 


Remark 3.3. (i) For n > 2, m > 3 it has been shown in [Kri99] that there does 
not exist a local characterization of (3.1), that is, there is no set of inequalities 
on W and its derivatives at an arbitrary matrix F which is necessary and suf- 
ficient for W to be quasiconvex. As a result, quasiconvexity is a very difficult 
property to verify in practice. Only few examples of analytical expressions of 
quasiconvex envelopes of particular functions are known with notable exam- 
ples reported in [KS86, Koh91, DSD02, CT05, CO05]. 
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(ii) The generalized formulation with gradient Young measures does not cir- 
cumvent the quasiconvexification. The set of admissible gradient Young mea- 
sures, besides the conditions listed in Remark 2.10, is characterized by the 
fact that Jensen’s inequality should hold for any quasiconvex function, i.e. 
[KP91] 


f((vz;id}) < (Vz; f) a.e. x € R, for all quasiconvex functions f. (3.7) 


For the constructive characterization and evaluation of W4° for general W 
one is, therefore, faced with a direct minimization of a nonconvex functional 
with linear boundary conditions and no lower order terms on an arbitrary 
domain w, whose solution may be, however, very difficult to tackle with. Nec- 
essary or sufficient conditions for quasiconvexity have been, therefore, intro- 
duced providing some insight for the analysis of microstructures. 


3.2 Rank-one convexity and laminated microstructures 


A necessary condition for quasiconvexity is rank-one convexity, stating con- 
vexity of the function W along all rank-one directions. 


Definition 3.4. A function W : R°*3 — R is rank-one convex if for all 
A, B € R°*3 such that rank(A — B) < 1 and all A € (0,1), 


WOA + (1-—A)B) < AW(A) + (1 — A)W (B). (3.8) 


Equivalently, W is rank-one convex if for all A € R°*°? and all a, n € R? the 
function A= W(A+ àa 8 n) is conver on R. This is in turn equivalent to 


W(A+a89n) <AW(A+a@n)+(1—A)W(A), (3.9) 
for all X € (0,1), and all A, a and n. 


The following considerations illustrate the relevance of rank-one convexity in 
the analysis of microstructures. By letting y € W'°(Q;R°) with y(x) = Fx 
on Ow, W9°(F) provides a macroscopic description of all possible microstruc- 
tures with average deformation F. In the evaluation of the infimum (3.6) it 
may be convenient to restrict y = y(x) to a subclass of W4(Q; R?) corre- 
sponding only to certain microstructure patterns. For example, one can con- 
sider the deformations ye = ye(x) describing first order laminates, and with 
ye(x) = Fa on Ow. The corresponding sequence of gradients will, therefore, 
oscillate between two phases 


Fo=F+(1-A)a@nand Fy = F— agn (3.10) 


with some a,n € R°*3, A € (0,1), and Fo — Fı =a @ n. The gradient Young 
measure associated with (ye) will be homogeneous and equal to 


v =n + (1 — AJF,- 
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wl | a/l Fo O 
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Fig. 3.1. (a) Microstructural patterns in first order laminates. (b) Graph represen- 
tation. 




















For such infimizing sequences (y¢) one has [Dac89, Mue99] 


ih — | W (Dye(x)) dx = \W(Fo) + (1 — )W(Fi). (3.11) 


lo |w] 


In the class of the first order laminates defined by (3.10), those that realize 
the lowest energetic content will therefore be solution of the problem 


ROW (F) =i = = = : 
(F) = inf{AW (F + (1 — àJa 8 n) + (1 — A)W (F — àa Q n)) 
Po Fi (3.12) 
0<A<landa,n€ R*}. 


If A= 0 or A = 1 then no microstructure will occur. The graphical interpre- 
tation of condition (3.10) and the corresponding microstructure pattern are 
depicted in Fig. 3.1. 

For Fo and F; given as above, consider the convex combination 


Fo = Ao Foo + (1 = Xo) Fou and Fy = 1 Fio + (1 = A) Fin, (3.13) 
with 
Foo — Foi = ao ® No and Fio — Fin =a, Q Nı. (3.14) 
By replacing (3.13) into (3.10) one obtains 


F = Ao Foo + A(1 a Xo) For + (1 — A)Ai Fio + (1 = A)(1 E Mi) Fir. (3.15) 


The graphical interpretation of this decomposition and the corresponding mi- 
crostructure pattern are shown in Fig. 3.2. 

Microstructures defined by (3.15) are called second order laminates. One 
can therefore inquire on the second order laminates (if they exist) that min- 
imize 1/|w| f, W(Du) dx. Those will be solution of the following global non- 
linear optimization problem 


Relaxation and the Computation of Microstructures 211 


o 
A F 
Fio 
WÈ : 
4 À 1—À 
= ee 1 F F 
og 1 
A Pa I M a 
a 


aw Foo Fo: F40 F11 






















(a) (b) 


Fig. 3.2. (a) Microstructural patterns in second order laminates. (b) Graph repre- 
sentation. 


RËW(F) = mef AW Fao) + X(1 — ào) W (For) + (1 — A)àÀ1 W (Fio)+ 


(=A) (1—1) W (Fi) : 0< A, ào,à1 < 1, and 


Q, N, Ag, NO, Q1, N1 E€ re}. 
(3.16) 
It can be shown that it is also 


ROW (F) = inf{AROW (Fo) + (1 —- AROW (Fi): 0<A<1, ane R*} 
(3.17) 


with Fo, Fı defined as in (3.10). The iteration of (3.17) produces laminates 
of order k € N such that [KS86] 


W'(F) = lim R®W(F), (3.18) 
and there holds 
W < W<.. < ROW <- < ROW < ROW <W. (3.19) 
In (3.18), W”° denotes the rank-one convex envelope of W defined by (3.2) 
with rank-one convex functions. 
3.3 A lower bound to W%: polyconvex envelope 


Polyconvexity was introduced by Ball in [Bal77] as a structural condition on 
W compatible with some physical requirements that simple convexity would 
violate, and that was sufficient to ensure existence of minimizers for nonlinear 
finite strain elasticity. Both polyconvexity and convexity provide sufficient 
conditions for quasiconvexity. 
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Definition 3.5 ([Bal77]). The function W : R3*3 — R is polyconvex if there 
exists a convex function g : R°*? x R3*3 x R — R such that 


W(F) = 9(T(F)) for each F € RÌ”. (3.20) 
Here 
T : F € R?” — T(F) = (F, cof F, det F) € B®? x RX? xR. (3.21) 


The function g is not defined uniquely from W. Using Carathéodory theorem 
it can be shown that one possible choice is [Dac89] 


19 
ATF) = inf 00 AiW(Ai) Ag > oN 1, 
AGER i i= 


(3.22) 
19 
DATA) = T(F)}. 


The value of the polyconvex envelope W?° at F € R3*3, can be, therefore, 
characterized equivalently as solution of the following minimization problem. 


19 
W?*(F) = af. Zawa N20) A51, 
Aj ER i i= 


(3.23) 
19 
D MT (Ai) = T(F)}. 


The semiconvex notions introduced so far reduce to convexity in the scalar 
case, whereas in the vector case, their relation is represented in the following 
diagram 


W convex = W polyconvex = W quasiconvex = W rank-one convex, 
(3.24) 
with the converse not holding in general [Mue99]. In view of (3.24) one has 


We < Wre < WI < W" < -< ROW < -< ROW < ROW < W, 
(3.25) 
with W?* and W"* providing lower and upper bound to W49°, respectively. 
From Thm. 3.2 it follows 





miy f W (Du) dz = atf, W™(Du)d = int f R®) 
ucA uEA Jo 


= int, f, ROW (Du) dz = ait W (Du) dz. 
(3.26) 
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4 Numerical relaxation 


The evaluation of rank—one convex and polyconvex envelopes for a character- 
ization of the quasiconvex hull is an extremely complex task since the energy 
density W is defined on four- or nine-dimensional matrix spaces in the space 
dimension n = m = 2 or n = m = 3 but can be reduced using invariance un- 
der rotations. Furthermore, the definition of an envelope is typically not local, 
that is, the value at a given F € R"*” depends, in general, on the values of 
W on the whole space R”*” and not just on a bounded neighborhood of F. 
In view of the difficulty involved in checking analytically the previous 
notions, one tries to resort to efficient numerical algorithms for the approxi- 
mation to rank—one convex and polyconvex envelope, referred to as numerical 
relaxation, exploiting growth conditions and qualitative properties of W. 


4.1 Numerical polyconvexification 


For F € R°*? the value of the polyconvex envelope at F, W°(F), given by 
eq. (3.23) involves a nonlinear optimization problem whose solution may be 
very difficult. Given a finite set of nodes Ms,r = 6Z°*?1 B,.(0) C R°*3, 6 mesh 
size such that 0 < 6 < r and r large enough so that F € co Nor; convex hull 
of N5,r, an approximation to W?*(F’) can be obtained by solving the following 
linear optimization problem over the space R#“6." with #N5,r the cardinality 
of the discrete set N5,,. 





WEF) = int, S D OWA 28420, D oam 
A AENS,r AENS,r 
(4.1) 


>X @aT(A) = r(e} 


AENS,r 


Under the assumption that W € CÈS (R?*3; R) with a € [0, 1] [Bar04a] shows 
that there exists r’ < r such that the following estimates holds 


WEF) — WPF) < dF W orac (4.2) 


0)) 


obtained by constructing a continuous piecewise multilinear approximation to 
W?*. Furthermore, let \¥,, € R!° denote the Lagrangian multiplier associated 
with the constraints 


S(04A=F, S OacofA = cofF, and X` 9adetA = det F. (4.3) 


If additionally a > 0 and W?* € CE9 (R3*3; R) then an approximation to o := 
DW?*(F) is given by A§,.0 DT(F), where DT(F) is the Gateaux derivative 
of T, and o denotes the composition operator between Af, € L(R'9;R) and 
DT € L(R?; R!?) (with L(R™; R”) space of linear operators of R” into R”). 
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The solution of (4.1) involves a large number of unknowns equal to the 
cardinality of the discrete set M5,-. The combination of an active set strategy 
with local grid refinement and coarsening to avoid checking a Weierstrass-type 
maximum principle in all the nodes of Ms,- leads to a very efficient but still 
reliable algorithm that computes WẸ%.(F') [Bar04al. 


4.2 Numerical finite lamination 


Approximations to the rank-one convex envelope W"* can be realized by 
RW by iterating the construction described in Sect. 3 and motivated by 
the condition (3.18). The algorithm proposed by [Dol99, D0103], on the other 
hand, performs convexification along rank-one directions until the function 
is stable under this operation. A pseudo-algorithm for the approximation of 
W’°° would therefore have the following main ingredients: 


Algorithm 4.1 (Numerical lamination) 

(a)k=0; ROW = W. 

(b) For certain F, and for a,n € R3, g(t) =convexify RW (F + ta 8n). 
(c) R®+DW(F) = g(0) and compare with R“ W(F) to stop, otherwise set 
k =k+1 and go to (b). 


Approximations are, therefore, introduced in step (a), by restricting the 
space R?** where to evaluate W, and in step (b) where only discrete set of 
rank-one directions will be considered. 

With the notation of Sect. 4.1, introduce the discrete set of rank-one di- 
rections 





= {6RER**?: R=a®n, with a,n € Z?}, 


and for R € R} the following set lr := {l € Z : F + €6R € coNs,,}. For 
assigned R € R}, the elements of lr, 5 identify the intersection of the grid 
coN5,, with the direction F + tR. In step (a) of the Algorithm 4.1, one set 
ROW = I5,-W as nodal interpolation of W in cos, whereas at step (c) one 
solves the following optimization problem 


RO W(F) = inf inf { ORY W(F+OER) : 6¢>0, Z Oe = iy, 
, RERS OER*!R.S leLR 5 lelR,s 


with RO W := œ in R®*? and nodal interpolation of Ry 0) W in coN5,r at 
variance of the algorithm proposed in [D0199]. 

Assuming W € Ct1(R?*3; R) and equal to W”° in R°*3\ B, (0) with some 
r, bounds on a and b in the definition of R}, and that there exists a lamination 
level L such that RW = W’°, [Bar04b] improves the estimate of [DW00] 





[RE OW — WT l| 12 (con's nik) < cô. (4.4) 


Even if one does not know L and r, Rey provides, however, an upper 
bound to W"* for all k > 0, r > ô > 0 and F € coN5,r- 
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5 Phase transitions and plasticity as vector nonconvex 
minimization problems 


This section discusses the numerical analysis and approximation of relaxed 
formulations for two types of nonconvex vector stored energy densities. In 
the first example the quasiconvex envelope is known whereas for the other 
one no analytical expression of any semiconvex envelope is available. In the 
latter case, therefore, we proceed to numerical relaxation by computing the 
polyconvex and lamination convex envelope. 


5.1 Compatible phase transitions in elastic solids 
We consider a solid with two phases, whose energy density takes the form 
W(F) = min {W1 (F), W2(F)}. (5.1) 


In a geometrically linear context, the energy of each phase is 
1 
W; = 5 C(F- Fj): (F — F3), (5.2) 


where C is the linear elasticity tensor, the symbol : the inner product in 
R”*”, n = 2,3, and F} the stress-free configuration of phase j. 

Since W is not rank-one convex, thus neither quasiconvex, the functional 
E in (3.3) is not sequentially weakly lower semicontinuous. Assuming that 
the two wells F} and F> are rank-one connected, then there exists an affine 
function that equals W at the two wells and is elsewhere a strict lower bound 
of W and, therefore, there is no attainment of minimizer. 

The quasiconvex envelope of W is given by [Koh91] 


W2(F) if Wo(F) +7 < Wi(F), 


| edn) +a) 
Wweay 


> 7 if |Wo(F)-Wi(F)| < y, (5.3) 
Fi (W2(F) — Wi(F)) =] 
Y 





Wi (F) if Wi(F) +7 < Wa(F), 


with y = 1/2(F> — Fi, C(F> — Fı)) for rank-one connected wells. In this case, 
W“ belongs to C1(R”*”; R) and is convex. Further, from a result of [CP00], 
one can show that the following conditions hold true for W4° and are, in fact, 
equivalent [HL93, CHO06] 


IDW“ (E) — DW (F)| < LIE— FI, (5.4) 
= |DW*(B) — DW®(F)|? < (DW%®(E) — DW*(F)):(E-F), (5.5) 


=7|DW*(B) — DW*(F)?? < W%(E) —W%(F) — DW*(F): (E — F), 
(5.6) 
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for any E, F € R"*”. Given the functional 


H(u) = [weetwyars | fude+ llulZ2cae) (5.7) 
with e(u) = sym Du, using (5.4)—(5.6), and the following condition 


W%(E) -W%(F) — DW” (F) : (E—F) < (DW (F)— DW®(E)) : (F — E) 


(5.8) 
that holds for any E,F € R”*” for the convexity of W%, [CHO06] prove 
the convergence of (2.13) for the minimization of (5.7) over A := up + 


Wo °(Q;R”) and the preasymptotic convergence rate of the energy. More 
precisely, let ôn := H(un) — H(u), with u and u, minimizers of H over A and 
Ahn, respectively. Then, there holds 


be + llo — oellecorxn + llu — wellZ2¢asmny) < C((5e — be41)"”? + osce) (5.9) 


with o := DW%(e(u)), C > 0 depending on the mesh regularity and material 
parameters, and oscg a node-patchwise definition of the data oscillations. The 
observation that (He) is a Cauchy sequence yields, finally, that 


on >o in L?(2;R"*"), and ur — u in L7(0;R"), (5.10) 


provided that one controls also the data oscillations. 


5.2 Single-slip elastoplasticity 


We consider here a simplified model for plastic deformation in ductile single 
crystals. We focus on two spatial dimensions, and on the case that only a sin- 
gle slip system is active, which is described by an orthonormal pair of vectors 
s and n, with s € S! (where S! = {x € R? : |x| = 1}) the slip direction on the 
slip plane and n € S! the normal to the slip plane. In a geometrically nonlin- 
ear context, we assume the multiplicative decomposition of the deformation 
gradient F = Fe F, with Fp = I + ys 8n, where y € R is referred to as plastic 
slip. Hardening is included through a single internal variable p € R. Within 
the framework of rate-independent processes [Mie03, Mie04a, Mie04b, Mie05}, 
we consider monotonic loading, or equivalently the first time step in a time- 
discrete scheme, and, set equal to zero the initial values of the internal vari- 
ables (y, p). Minimizing out locally the internal variables leads to a variational 
formulation analogous to (1.1), which can again be analysed using the dis- 
cussed methods of the calculus of variations. The analogy with the study of 
martensitic microstructures via continuum models based on nonlinear elastic- 
ity, and the study via a variational problem expressed only in terms of the 
deformation gradient F = Dd, was advanced for the first time in [OR99]. 
The constitutive behaviour of the single crystal can be described in terms of 
two potentials: the free energy density W (Fe, p) and the dissipation potential 
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J(y,p). The free energy density is sum of an elastic and a plastic contribution 
as follows 

W (Fe, p) = We( Fe) + Wp(p), (5.11) 
with 


We(F.) = UCF.) + Ë (F.P - 2), Wo) =p, (6.12) 


and U (Fe) a polyconvex function defining a Neo-Hookian material, such as 





= (det Fe)? Ay 


+oo else, 


2 
a É log (det F.) if detF. > 0 


U(F.) = (5.13) 


with u, k material constants and h the hardening moduli. The dissipation 
potential J(y,p) is 


Terly| if ly|+p <0 
J(y,p) = (5.14) 
oo else , 
with Ter the critical shear stress. This is the same model considered in [CHM02, 
BCHH04, MLG04]. 
For this particular example, by minimizing with respect to the internal 
variables (y, p), we obtain a closed form of the condensed energy Weona(F’) as 


(mazx(0, u|C's -n| — Ter))? 


H 2 1 
con F) = F SAER , 
Weona(F) = UCF) + (PP - 2) - 5 EE A 


(5.15) 


with C = FTF. The energy density (5.15) is not rank-one convex and, hence, 
not quasiconvex. As a result, one may expect non attainment of minimizers for 
the corresponding functional, and developments of oscillations in the gradients 
of low-energy deformations. For the case under consideration, the occurrence 
of such microstructures can be shown by a direct finite element simulation 
using representative volume elements under periodic boundary conditions, cf. 
[HH02]. Figure 5.1 shows two typical results of these simulations: Oscillations 
in the plastic slip field y, forming first and second order laminates. These 
oscillations are highly mesh-dependent with the number of oscillations growing 
towards infinity when the mesh becomes finer and finer. 

The macroscopic material behaviour can be, however, understood by min- 
imizing out locally the possible microstructures and defining the quasiconvex 
envelope of Weona. Unfortunately, a closed form for condensed energies of the 
kind of Weona is known only in few simplified cases [Con03, CT05, CO05). 
We therefore resort to an approximation to the rank-one convex envelope 
Were al F) (Sect. 4) based on laminates. 

Let a, b € S! with a = (cosa, sina) and b = (cos 2, sin 8), then all the rank 
one matrices can be expressed in R?*? as pa @ b for a, 3, p € R. Considering 
first order laminates, the average energy is given by 
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(6) 


Fig. 5.1. Single-slip plasticity (a) First order laminates and (b) Second order lami- 
nates as from (3.12) and (3.16) respectively, assuming periodic boundary conditions. 
(See page 693 for a colored version of the figure.) 


E'O (Fa, B, à, p) = AWeona(F + (1 — A)pa @ b) 
(5.16) 
+(1 = A)Wcona (F 7 Apa ® b) 


with the microscopic energy Weona(F’) defined in (5.15). Let q = (a, B, À, p) 
and introduce the feasible set 


D={qeR': a,8,p ER, »€(0, 1}, 


the first order laminate envelope is obtained by solving the following global 
optimization problem 


R Weona(F) = min EO (F; 4) , (5.17) 
qE 


under the constraints 
det(F +(1—A)pa®@b)>0, det(F — Apa 8b) > 0. (5.18) 


Following the definitions in (3.15) corresponding minimization problems 
can be set up for higher order laminates. The growing number of optimization 
variables, however, strongly limits a practical application. Already for low 
order laminates the numerical search for the minimizer of (5.17) turns out 
to be difficult, because the objective function may present an exponential 
number of nearby optimal local minima [Car01]. 

Within the techniques of global optimization for the solution of (5.17), 
probabilistic global search procedures are the one commonly adopted. Apply- 
ing a local search several times starting from randomly chosen sampling points 
leads, however, to an inefficient global search, because the same local mini- 
mum may be identified over and over. As an improvement, clustering methods 
attempt to avoid this inefficiency by carefully selecting points at which the 
local search is initiated. 
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Algorithm 5.1 (Clustering method) 

Input F, initial population q; E X of n starting points, tolerance €. 

(a) (Sampling and reduction): Sample the objective function E' at qi and 
reduce population taking the m best points giving the least value. 

(b) (Clustering): Identify clusters, such that the points inside a cluster are 
‘close’ to each other, and the clusters are ‘separated’ from each other. 

(c) (Center of attraction): Identify a center of attraction in each cluster. 

(d) (Local search): Start a local search from the center of attraction and stop 
when a minimum is reached within the tolerance e. 


Output the value of RY Weond(F). 


The final local search step is done by using sequential quadratic program- 
ming methods with simple bounds [NW99]. Since in a finite element frame- 
work the above algorithm has to be performed at every material point (e.g. 
Gauss point), for real applications it is important to develop fast techniques 
for the numerical relaxation. In the literature the computational effort re- 
lated to the global search is usually reduced by fixing some laminate related 
parameters on the basis of conjectures motivated by physical considerations 
[ORS00, AFO03, ML03, MLG04]. 


Mixed analytical-numerical relaxation 


A different approach to the relaxation of Weong over laminates has been pur- 
sued in [CCO06]. Rather than attacking the global minimization by a brute- 
force global optimization algorithm that is anyway computationally very ex- 
pensive, [CCO06] exploit the structure of the problem both to achieve a fun- 
damental understanding on the optimal microstructure and, in parallel, to 
design an efficient numerical relaxation scheme. Inspired by results based on 
the global optimization [BCHH04] and on analytical relaxation in the case 
of rigid elasticity and no self-hardening [CT05], we determine analytically a 
second order laminate which has ‘good’ energy and furnishes an upper bound 
to the relaxed energy. 

We consider first an elastically rigid problem where the elastic part of the 
deformation is assumed to be a rotation, and only the contribution from the 
plastic free energy is considered, i.e., dissipation is neglected. The condensed 
energy for this case is then given by 


La j = SQM 
waf? fF=QI+ys8m) Qe sSO(2), Bio 


œo else, 





with the quasiconvex envelope obtained as follows 


Theorem 5.1 ([Con03, Con05]). The quasiconvez, rank-one convex, and 
polyconvez envelope of W'(F) are equal and given by 


(5.20) 


oo else. 


L (Fm? — 1) if det F = 1 and |Fs| <1, 
W,(F) = 
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Fig. 5.2. (a) Bounds to the quasiconvex envelope of the condensed energy for zero 
dissipation; (b) Polyconvex and second-order laminate envelope for the condensed 
energy density in single-slip plasticity. 


The optimal energy is given by a first-order laminate, which is supported on 
two matrices which have plastic deformation y of the same magnitude and 
opposite sign. 


We then construct a more refined model by assuming the microstructure to 
have the form of a laminate of second order, which is supported either on 
rigid-plastic deformations or on purely elastic ones. In this case, assuming 
volume-preserving deformations, the relaxation is reduced to a global mini- 
mization of a function of only one variable which defines the orientation of 
the laminate. Using this solution and the splitting of F, from Thm. 5.1, we 
obtain an approximate second order laminate. The latter is then used as a 
starting point for the local minimization of the full energy density, including 
dissipation, and removing the kinematic constraint. 

Figure 5.2(a) depicts the condensed energy Weona (see eq. (5.15)) together 
with We (see eq (5.12)), W,, (see eq (5.20)) and the value of the energy over 
the approximate second order laminate (which we denote by f**(0)) for the 
case of a pure shear strain F = I + €r @ r+ with r = (1,0), r+ = (0,1) and 
for the material constants u = 1.0-10*MPa, « = 1.5 - 104 MPa, h = 1.0- 102 
MPa and Ter = 10 MPa. 

Figure 5.2(b) shows a very good quantitative agreement for the values of 
the relaxed energy with those in [BCHH04] which had required a significantly 
higher numerical effort and compares approximations of the polyconvex hull 
WP (F), realized with the procedure described in Sect. 4.1. A finer analysis at 
small deformations reveals however some differences, which will be discussed 
elsewhere [CCOO6]. 

Figure 5.3 depicts finally the value of the volume fractions A and Aj 
whereas ào = 1. Initially, the material is in a homogeneous elastic state. Then 
an elastic state and a mixture of two opposite-slip plastic states appears. The 
volume fraction of the elastic phase starts at 100% and then decreases contin- 
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Fig. 5.3. Volume fractions À and A for different values of € 


uously until it vanishes at a shear € = 0.13. Both plastic phases then progress 
with slowly varying volume fractions until the homogeneous phase F is stable. 


6 Conclusions 


In this paper we have considered the numerical analysis of relaxed formula- 
tions for variational formulations lacking lower semicontinuity, and discussed 
algorithms for the approximation of the quasiconvex envelope of energy den- 
sities, in cases where it is not known in closed form. Relaxed solutions convey 
important information on the macroscopic behaviour of the microstructure, 
summarized by the relaxation theory. A resulting benefit is that the approxi- 
mation of macroscopic quantities does not pose severe difficulties and classical 
algorithms for numerical optimization can be efficiently employed. 
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Summary. We discuss the derivation of two-dimensional models for thin elastic 
sheets as T -limits of three-dimensional nonlinear elasticity. We briefly review recent 
results and present an extension of the derivation of a membrane theory, first ob- 
tained by LeDret and Raoult in 1993, to the case of incompressible materials. The 
main difficulty is the construction of a recovery sequence which satisfies pointwise 
the nonlinear constraint of incompressibility. 


1 Introduction 


The formulation and the study of reduced theories for the elastic properties of 
thin films is a traditional field of (heuristic) multiscale analysis, dating back to 
Euler. A rigorous derivation of such models from three-dimensional elasticity 
theory is instead a recent development, and is based on the mathematical tools 
of ’-convergence, which permit passage to the limit for variational functionals. 

The starting point is three-dimensional nonlinear elasticity. Precisely, let 
the cylindrical domain N, = w x (—h/2,h/2) represent the reference configu- 
ration of the sheet, where w C R? is the cross-section and h the (small) height. 
The deformation field u : Rp — R is determined by minimizing the elastic 
energy, which after scaling with the film thickness h takes the form 


Erlu, f] = F (W3p(Vu) — fn- u) dz, (1.1) 


where fa is a given vector field representing the applied forces. The deforma- 
tion u can be additionally subject to boundary conditions on all or part of 
the lateral boundary (ðw) x (—h/2, h/2). Different scalings of fa with h lead 
to different scalings of the optimal energy, and correspondingly to different 
limiting functionals. In the relevant cases the term fn -u turns out to be a 
continuous perturbation of the term W3p(Vu). 
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The history of the theory of thin sheets and shells is paved with a number 
of simplifications of the general elasticity functional (1.1) aiming at a direct 
description of the behavior of the sheet as a two-dimensional object, without 
resolving explicitly the structure in the third dimension. Different assump- 
tions on the energy density W3p and on the scaling of the forces (and hence 
of the energy) lead to drastically different limiting functionals. Mathemati- 
cally, for different energy scalings one deals with problems from relaxation, 
from variational convergence of singularly perturbed problems, and from the 
geometry of isometric immersions. 

A number of heuristic derivations led to the formulation of the most rel- 
evant limiting theories in the field; the study was then made systematic by 
means of asymptotic expansions, which rely on a-priori assumptions on the 
structure or at least on the regularity of the minimizers, see [Cia97]. Only 
recently it has become possible to derive in a rigorous way two-dimensional 
reduced models from the variational problem of elasticity (1.1), by using the 
concept of -convergence, as introduced by De Giorgi and his school [DGF75], 
see also [Dal93, Bra02]. We focus here on the derivation of a nonlinear mem- 
brane theory for incompressible materials, and generalize the result obtained 
for finite-valued energy densities by LeDret and Raoult [LR93, LR95] to the 
case where the nonlinear unit-determinant constraint is imposed. Essentially 
the same result was independently obtained by Trabelsi [Tra05, Tra06]. 


2 Rigorous derivation of elastic theories for thin films 


2.1 General framework 


I’-convergence corresponds to convergence of energy along minimizing se- 
quences for a family of functionals and all continuous perturbations [DGF75, 
Dal93, Bra02]. Precisely, a family of functionals Fe : X — RU {oo} T- 
converges to Fo : X — RU {oo} if the following properties hold: 


(i) For every pair of sequences £j — 0, uj — u, one has 


Fou] < lim inf Fe, [u,]. 

j= 

(ii) For every u € X and every sequence ¢; — 0 there is a sequence uj — u 
with 
Foļu] = lim Fe [uz]. 

jroo 
The notion of I’-convergence depends on the topology of the space X (which 
in this discussion is assumed to be metrizable). The most natural choices are 
those for which the family F is equicoercive, i.e., 


for any t € R the set fu € X : F.[u] < t} is precompact. 
E 
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Indeed, if I — lim Fs = Fo and the F, are equicoercive, then the set of min- 
imizers of Fo coincides with the set of accumulation points of minimizing 
sequences of the family Fe. We also recall that functionals defined on sub- 
spaces Y C X can be trivially extended setting them to be co on X \ Y. 

Appropriate notions for the convergence of a family of deformations un : 
Qn, — R? to a limit v : w — R? are best defined after rescaling to a single, 
fixed domain. We define the rescaling operator Tp by 


1 1 
(Tru) (x, 73) = u(x, haz) for x’ € w and 73 € (-5 5) (2.1) 


For the limiting map v : w > R? the rescaling reduces to (Tov) (x', x3) = 
v(x’). The relevant convergences are weak and strong convergence in 
W??(Q,;IR°)/R°. The quotient means that an additive constant has been 
factored out, i.e., we say that Trun —> Tov in W1?P(1; R3)/R? if there are 
bn € R3 such that Thun — bn —> Tov in W1 (weakly or strongly, respectively). 

Given a Borel measurable function W3p : M3*3 — [0, oo], we shall consider 
the family of functionals I, : W1? (2p; R?) — [0,00] defined by 


I, [u] = 7) W3p(Vu) da. (2.2) 


2.2 Smooth energy densities 


I’-convergence of the functionals J, to a membrane theory was obtained by 
LeDret and Raoult in 1993-1995 [LR93, LR95], for energy densities with p 
growth (see (2.6)). The limit functional J : W!?(w; R3) — [0, 00) is 


J[u] = / W355 (Vu) da’ . (2.3) 


The energy density W,}5 is obtained by first optimizing over all possible di- 
rections of the normal gradient b, i.e., setting 


Wap (F") = inf {Wsp(F’|b) : b € R?} , (2.4) 


and then replacing W2p by its quasiconvex envelope [Dac89, Miil99], 


Woe y= ard Wop (F’ + Vy) da’ : y € Wy’ (0, ve} . (2.5) 


(0,1)? 


Theorem 2.1 (From [LR93, LR95]). Let w be a bounded and connected 
Lipschitz domain in R?, and suppose that Wsp : M°*? — R satisfies 


1 
=|F|P — c < Wp(F) < c| FP +c, for some p E€ (1,0). (2.6) 
c 


Then the sequence In o T; ' is weak-W1? (R1; R?)/R? equicoercive, and 
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with respect to the weak-W +? (91; R3) topology. 


The functional J o J,’ is defined by (2.3) on ZoW1P (w; R3), and extended 
by oo to the rest of W!?(Q,;R?). By the compactness result, the statement 
is equivalent to I’-convergence with respect to the strong L! topology. 

If the energy tends to zero with h, one is interested in considering rescaled 
functionals, such as h~°I;, for a > 0. The limit can be finite only on the null 
set of J. To analyze this regime we assume that 


W3p(QF) = W3p(F) for all Q € SO(3) and all F € M?*’, (2.7) 
Wsp(Id)=0,  Wsp(F) > Cdist?(F,SO(3)), and (2.8) 

W3p is C? smooth in a neighborhood of Id. (2.9) 

Then J is minimized by short maps, i.e., maps v : w — R? such that |v(2’) — 


u(y’)| < |x’ — y'| whenever the segment [2’, y’] is contained in w. 


Theorem 2.2 (From [CM05]). Let w be a bounded Lipschitz domain in R?, 
a € (0,5/3), and let W3p obey (2.7-2.9). Then 


0 if v is short, 


œo else, 


dee. a z = 
P= lim FyIn oT, = oT, nisi={ 


with respect to the weak W1? topology. 


A different behavior is instead obtained if Dirichlet boundary conditions are 
imposed, both in the compressive case [BC*00, JSO1, BC*02, JS02, Con03}] 
and applying external forces to a membrane whose boundary is kept fixed 
[CMM06]. The latter work has in particular permitted to derive by T- 
convergence a relaxed version of the classical membrane theory by Foppl. 

For a = 2 one obtains a plate theory, which describes the bending- 
dominated regime. A rigorous derivation, confirming the expression first ob- 
tained by Kirchhoff in 1850, was recently obtained by Friesecke, James and 
Müller [FJM02] (see also [Pan03}). 


Theorem 2.3 (From [FJM02]). Let w be a bounded Lipschitz domain in 
R?, and suppose that W3p satisfies (2.7-2.9). Then the sequence I), o Gia" is 
strong-W1+:?(Q;IR°)/R® equicoercive, and with respect to the same topology 


1 
r— lim jain (0) T = Jplate (0) To” $ 
The limit Jplate is defined by 


1 

zS Qe(II,)dz" ifv € W>? (w; R3), and Vu € O(2,3) a.e., 
Jplate|U] = 24 w 

oo 


else, 
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IL, = (Vv)? V (ôv A Gav) and 


ew. 
Qo(F") = inf {Q3(F" +d@e3+e3@d):deR*}, Q3= n (Id). 





2.3 Incompressible materials 


The foregoing results assume that the free energy density is finite (or even 
smooth) on an open set in the space of deformation gradients. This as- 
sumption excludes the important class of rubber-like materials for which 
the energy density incorporates an incompressibility constraint, in the sense 
that the energy is infinite for all deformation gradients F that do not sat- 
isfy the nonlinear condition det F = 1. This situation arises, for example, 
in the analysis of soft elasticity in thin sheets of liquid crystal elastomers 
[WT96, DD02, CDD02, WT03, ACDO6]. 

We focus here on the derivation of a nonlinear membrane theory for in- 
compressible materials. Theorem 2.1 concerns finite-valued energy densities; 
it was later extended by Ben Belgacem to energy densities which are infinite 
if the determinant of the deformation gradient is negative [Bel96b, Bel96a]. 
We characterize here the I’-limit for incompressible materials and prove that 
the mechanism by which the limiting energy density is obtained is similar. 
It is a remarkable feature that the incompressibility constraint is lost in the 
I-limit. Essentially the same result was independently obtained by Trabelsi 
[Tra05, Tra06]. We give here a full proof, including a discussion of the relevant 
ideas from differential geometry. At variance with [Tra05, Tra06], we do not 
need to take the rank-one convex envelope of W2p first. 


Theorem 2.4. Let w be a bounded and connected Lipschitz domain in R?, 
and suppose that W3p : M?” > RU {oo} has the form 


Wo(F if det F = 1, 
Wsp(F) = se a i (2.10) 
oe) else, 
where Wo : M8*3 — R is continuous, nonnegative, and satisfies 
1 
=|F|P — c < W(F) < c| FP +c, for some p € (1,0). (2.11) 
c 


Then the sequence In o T; ' is weak-W1? (R1; R?)/R? equicoercive, and 
r- lim Ino Tp = J o Ty 
with respect to the weak-W1:?(Q,;R°) topology. Equivalently: 


(i) (Compactness) For every sequence hj — 0, and every sequence uj : Qn, > 
R? such that In luj] < C < œ, there exists av € W™P(w; R?) and a 
subsequence such that 


1 
Th, Uj, — Teh f Thi, uj, dx dz3 + Tov in WP (R; R°). 
© 
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(ii) (Lower bound) If additionally Tn uj — Tov in W™P(R1; R3), then 


liminf Jp, [uj] > J[o]. 
J> O0 
(iii) (Upper bound) For any u € WP (w; R?) and any sequence hj — 0 there 
is a sequence uj E C®(Nn,; R°) such that T,,u; > Tov in WHP(Q1; R°) 
and 
lim sup Jn, [uj] < J[v]. 
T= 

The proof of this theorem involves two main parts: (i) the verification 
of the compactness statement together with the liminf inequality, and (ii) 
the construction of a recovery sequence. The definition of the limiting energy 
Wp is based on the notion of quasiconvexity in the sense of Morrey, which 
naturally relates the lower semicontinuity arguments needed in part (i) with 
the explicit constructions needed in part (ii). 

The main difficulty is here the construction of the recovery sequence for 
the upper bound. This requires several steps that are carried out in Sect. 4—6. 
The naive idea is to try to extend a given function u : w — R3 in such a way 
that the gradient of the extension up is approximately 


F(a! 23) = (ðrula'), dula’) Orti A Gau )) 


aT Gee Ge 
ðu TAN zu]? 


This expression satisfies the condition det F(x, x3) = 1, however F is not, 
in general, a gradient field. This construction method can in fact be made 
precise if |O,u A ðzu| # 0 for all x in w, see Sect. 5 and 6 for details. One 
crucial step in the construction of the recovery sequence is therefore to show 
the existence of a small perturbation of u in WŁ? with the property that the 
wedge product does not vanish in w. This is also a crucial step in the proof 
of Whitney’s embedding theorem [Whi43, Whi44a, Whi44b], where Whitney 
proves the existence of an approximation in C°. Indeed, Whitney’s arguments 
can also be used in the situation at hand, and permit to obtain the desired 
WP approximation as well. We summarize the main steps of his argument 
in Sect. 4, for the special case of interest here, focussing on the few additional 
estimates and corrections needed for this application. Whitney’s construction 
does not, however, provide a good energy estimate. In particular, in order to 
control the energy of the modified function we need a good estimate for the 
LP norm of the inverse of the wedge product, i.e., for 1/|31u A 02u|. This is 
achieved by means of local perturbations with smooth oscillations in the spirit 
of the work of Nash and Kuiper [Nas54, Kui55]. 

Also a plate theory can be obtained for incompressible materials. The 
following result was announced in [CD05], its proof will appear elsewhere. 


Theorem 2.5. Let w be a bounded, convex Lipschitz domain in R?, let W3p 
be as in (2.10), and Wo obey (2.11). Let QiP° = 0?Wo/OF?(Id), and 
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Qe (F") = inf {Q3(F” +d® e3 +e3 @d): dE R*, Tr(Gld) = 0} . 


Then all assertions of Theorem 2.8 hold, with Jplate replaced by 


1 ; 
=| Qe °(II,) da’ ifv € W2?(w;R), and Vu € O(2,3) a.e., 


patel?) = 


oo else. 


3 Incompressible membranes: lower bound 


We now start the proof of the equicoercivity and the lower bound. For sim- 
plicity we drop in the following the prime on 3 x 2 matrices. We first observe 
that due to the continuity and coercivity of Wo, (2.4) reduces to 


(3.1) 


oo else. 


min W(F|b) if rank F = 2, 
Wop ( ) = bER3 


The growth condition (2.11) implies, for all F € M?*? with rank F = 2, 


1 1 


cIRA RP Os Walk) SAF +e 


1 
FP + (3.2) 


1 
[AARP 
We first study some properties of the relaxed energy W5,. It is convenient 
to work with piecewise affine and continuous functions. We denote by P! the 
space of Lipschitz functions whose gradient is constant on each of at most 
countably many disjoint open triangles Tj, whose union is, up to a null set, 
equal to w. The properties of W3p listed in the next lemma follow immediately 
using the results and the techniques of [BM84, Fon88, Dac89]. 


Lemma 3.1. The function W3p has the following properties: 


(i) the definition does not depend on the choice of the domain w, as long as 
it is open, bounded, and |Ow| = 0; 
(ii) for any 5 > 0 there is p € P! N Wo’? (w; R3) such that 


1 
if, Wal + Velde < Wip(F) +6 and J reese: 


(iii) there is a constant c such that 
1 
Linh -e< Wip(P) < FP +e; 


(iv) W3p is quasiconvex and Lipschitz continuous on bounded sets. 


We are now ready to prove the compactness and the lower bound. This 
part of our result follows from the characterization given above and the results 
by LeDret and Raoult. For completeness we give a short self-contained proof. 
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Proof of Theorem 2.4 (i) and (ii). We work with the rescaled functions U; = 
T,,uj. By the growth condition (2.11) we have 


1 
f, |O,U;|? + |02U;|? + pp losusl? dz dz3 < c. (3.3) 
J 


After subtracting the average value, the sequence U; is uniformly bounded 
in W?(Q;IR3). We may choose a subsequence (not relabeled) that converges 
weakly in W!?. Further, 03U; — 0 strongly in L? by (3.3), hence the limit 
does not depend on x3. This concludes the proof of the compactness part. 
The lower bound is proved by lower semicontinuity (see [LR95]). Con- 
sider the function Z : M3*3 — R defined by Z(F) = W% (F) where 
F = (F,|F2). By Lemma 3.1 this function is ee continuous, and 
has p-growth from above. Therefore the functional Tlu =i o Z(Vu) dx dz3 
is lower semicontinous with respect to weak Wl? Eee o Since 


W(F) > Wa (F) > Wà (F) = Z(F) we obtain 


1 
imine f W (avul 2st) dx dz3 
2 j 


IMC 
= | Z (O:V|O2V) dx dx3 = J WD (ðiv|ðzv) dz, 
2 w 


with V = Tov. This concludes the proof of the liminf inequality. 


4 Approximation with regular maps 


The key ingredient in the construction of a recovery sequence is an approxima- 
tion result for functions in the Sobolev space W +? (w; R?) by smooth and regu- 
lar functions, i.e., by smooth functions u such that rank(Vu) = 2 everywhere, 
as discussed above. This approximation theorem follows essentially from Whit- 
ney’s [Whi43, Whi44a, Whi44b] work on the nature of singularities of maps 
u : R” — R?"-!. Moreover, it is crucial to ensure the lower bound (4.2) 
below. We shall obtain it by using methods developed by Nash [Nas54] and 
Kuiper [Kui55]. For the convenience of the reader we sketch below the con- 
struction for the two-dimensional case of interest here. 

The statement in the following proposition shows (i) that any smooth map 
u: w — R? can be uniformly approximated by regular maps, i.e., maps which 
fulfill rank Vu = 2 everywhere; (ii) that if the map u is regular away from 
a connected set l C w containing Ow, then the regular maps can be taken 
to differ from u only in a neighbourhood of I’, and (iii) that in a (smaller) 
neighbourhood of I’ the stronger condition |O,u A Q2u| > c can be enforced, 
for some universal constant c (i.e. the approximating maps are order one away 
from being degenerate). 
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Fig. 4.1. The fundamental singularity of mappings from R? into R: “Whitney’s 
umbrella”, given by the equations in (4.4) (up to a rigid rotation). 


Proposition 4.1. Let w C R? be a Lipschitz domain, I C @ a connected set 
which contains Ow, and define for n > 0 the set I, = {x € ù : dist(a, I) < n}. 
If u € C (©; R?) and rank Vu = 2 on w \ Ty, then for any 5 > 0 there exists 
a w E C™(H;R*) such that ||u — wl|cow) < ô, 


|Vw| <c(|Vu] +1), [ðw A ðzw| > côu Azu) onw (4.1) 
and 
w= u on ù \ Ion, [ðw A 2w] > c on Th. (4.2) 
All constants are absolute constants. 
Proof. We split the argument into several steps in which we use small pertur- 
bations to show that we can accomplish the following goals: 1. The rank of Vu 
is greater than or equal to one everywhere. 2. The rank of Vu is equal to two 
everywhere except at finitely many points which correspond to the canonical 
singularity of mappings from R? into R3, called “Whitney umbrella”. This 


means that for each p € w where Vu does not have full rank, there is a 
direction e = e(p) € S! with 


Veu=0, span (Veru, V2,.u, Veu) = R? (4.3) 


(here Veu = Vu - e, and so on). Equivalently, u can be locally represented, 
after a smooth change of coordinates in the domain and in the target, as 


(£1, £2) > (x7, £2, £122) (4.4) 


(the equivalence is shown in [Whi43, Th. 1]). 3. The rank of Vu is equal to 
two everywhere. 4. The lower bound on the angle between 0,u and zu (see 
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(4.2)) holds. The Steps 1.-3. follow Whitney’s original work, and the final step 
is based on ideas of Nash and Kuiper. 


Step 1: For all 6 > 0 there exists a smooth function us such that u = us on 
w\ Ton, |Vus— Vu] < ô, |O1us Adgus| > $|O:uAd2u| on Tən \ Th and gus + 0 
everywhere. 


Let Yy be a smooth cutoff function supported in I, and such that wy, = 1 
on I). We define, for a € RÌ, the function u® (x) = u(x) + vy (a)a2 a. For a 
sufficiently small we get |V(¢,(x)az2)| < 6. We assert that we can choose a 
arbitrarily small so that uS (x) # 0 everywhere, and rank Vul®) = 2 outside 


I,. Consider first points in I}, where ul?) =u2t+a. The set u.2(@) C RÌ has 


dimension two, hence there are arbitrarily small a such that —a ¢ u.2(@), or 
equivalently u,2(x) + a # 0 for all x € w. Now consider points in the compact 
set Ton \ Ty = Tan \ {a € R? : dist(x, T) < n}. Here 


Vu (x) = Vu(z) + Y,(z)a Q e2 +2208 Vy, 


hence the distance to Vu is controlled by a times constants depending on 
I and 7. Since rank Vu = 2 on Igy \ I}, the function |O,u A O2u| has a 
positive minimum on the same set, hence for |a| sufficiently small we have 
[3u A dul| > $lðu A zuj. Finally, outside of I>, the two functions 
coincide, hence us = ul® has the desired properties for |a| small enough. For 
simplicity we write in the following u for us. 


Step 2: Suppose that u is a function with the properties established in Step 1. 
Then for all 6 > 0 there exists a smooth function us such that u = us on 
w\ Ton; |Vus — Vul < ô on w, |Orus A O2us| > tau A O2u| on Tən \ Th, and 
rank(Vus) = 2 except at finitely many points in T, which are of the Whitney 
umbrella type (4.3). 


For 3,7 € M?*? and Yy as in Step 1, let ug (x) = u(x) + Wy, (x) [6x + ziya]. 
The fact that xı appears explicitly in the perturbation is related to the choice 
of having u.2 # 0 in Step 1. As above, the function u has been modified only 
inside Ion. For sufficiently small 8 and y the gradient of the correction is 
uniformly small, hence the bounds for |Vus — Vu| and |O,us A Q2us| follow. At 
the same time the set of (3, y) € R' for which ug has (in T4) singular points 
not of the type (4.3) has dimension at most 11 [Whi43, Th. 2]. Therefore one 
can choose arbitrarily small @ and y so that all singular points of ug, are of 
that class. Finally, singularities of the type (4.3) are necessarily isolated, and 
therefore ug, can have at most finitely many singularities in the compact set 
©. As before, we are going to call ug again u. 


Step 3: Suppose that u is a function with the properties established in Step 2. 
Then for all 6 > 0 there exists a smooth function us such that u = us on 
w\ Ton, |Vus| < C(|Vul+1) and |ju—us| < 6 on ù, |OiusAdgus| > 4lðruA^ ðu] 
on Ion \ Th, and rank(Vus) = 2 everywhere. 
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Fig. 4.2. (a) Modification of the map in Figure 4.1 in order to remove the singularity. 
(b) Plot of one half of surface in (a). This view illustrates that the modification is a 
smooth surface in R for which the tangent plane in each point has full rank. (See 
page 693 for a colored version of the figure.) 


This step cannot be accomplished with purely local perturbations. Indeed, 
Whitney derived a sum rule which relates the total charge of the singularities 
to the behavior of the map at the boundary [Whi44b]. This implies that they 
cannot be eliminated without modifying the boundary values. The proof of 
Whitney’s embedding builds upon the fact that singularities can be moved to 
the boundary by means of C® perturbations of the given map (this is, however, 
in general impossible with C! perturbations). We sketch the construction and 
show in particular that one does not need to modify the map outside I, and 
that the stated bounds can be satisfied. 

Let p1, ..., pk E I, be the finitely many points where Vu has rank one. We 
first assert that there are smooth arcs A; of finite length in I, which are closed, 
disjoint, and such that A; starts in 2; and ends on Ow. To construct these arcs, 
we first choose k distinct points q1,...,q% E€ Ow. Each p; can be connected to 
qi by an arc of finite length contained in I7,. Indeed, T is connected and can 
be covered by finitely many balls of radius 7; we can therefore choose an arc 
that has length at most 37 inside each of the balls and therefore finite total 
length. By a standard perturbation argument the arcs can be modified (the 
pairings of the points p; and q; may also change) so that the new arcs are 
disjoint, and the assertion is proven. It follows that the distance dist(A;, A;), 
i Æ j, is bounded from below by a constant € > 0, and we can also assume 
E<N. 

In a neighbourhood of each x; the map u is, in suitable coordinates, of 
the form (4.3) (see Fig. 4.1). We modify the surface as illustrated in Fig. 4.2, 
inserting a thin cylinder which (in the original coordinates) follows the path 
A; from the singularity to the boundary of w. If each of these corrections is 
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supported on an ¢/3-neighbourhood of the arc A; then the ones constructed 
for different singularities do not interfere, and do not change u outside of Ion. 
Notice that this construction may generate large curvature in the cylinders, 
but the increase in C! norm is bounded. From now on we denote by u the 
function obtained after these modifications. 


Step 4: Suppose that u is a function with the properties established in Step 3. 
For any 6 > 0, there is a smooth function w such that 


1 1 
[ðw A w| > z” Ia, [w A d2w| > glu ^ ô2u| on To \ In 


with w = u on w \ Ton; |Vwl < C(|Vu| +1) and |w — u| <6 on ù. 


In this step we need to modify u so that |w,1 \w,2| has a lower bound of order 
one. We use hereafter for brevity the notation w; = 0;w. Our approach follows 
that used by Nash and Kuiper [Nas54, Kui55], with the difference that here 
we only need two iteration steps. The basic idea is to insert locally oscillations 
to increase the appropriate components of the gradient. 

We first sketch the construction for the simpler case of one spatial di- 
mension. The corresponding statement is that any smooth parameterized 
curve y : [0,1] — R? with y~’ 4 0 can be modified so that |y'| > 1. The 
key idea is to insert fine-scale oscillations normal to y’, and to multiply 
them by a cutoff function to localize the correction. Precisely, one writes 
elt) = y(t) + ev (t)P(t/e), with @ a periodic oscillating vector field, and a a 
smooth cutoff function, which is equal to one on the regions where ¥’ is small. 
Taking the derivative one sees that the term w@’ is of order one, whereas ey’ B 
is of order £, hence negligible for small e. 

Assume now that y parameterizes a line segment, y(t) = (y1(t),0), with 
yi > 0. This is the generic case, if one works in a coordinate system that 
moves with the curve (here one needs y’ # 0, and additional smoothness of 
y). Since the normal component 2 oscillates on a scale £, its derivative is 
often zero. Hence a uniform control on |74| from below cannot be gained via 
the normal derivative alone. We therefore seek a bound from below for the 
other component, |y, + w&{|. In the region where yj is large, y = 0 and 
there is no correction. In the region where y; is small, Y = 1 and the term 
yw, dominates. Hence there are points where the two terms in |y + ~®{| are 
equal, and one needs to ensure that no cancellation can occur. 

The argument above illustrates that one cannot use simple oscillations, 
like S(t) = (sin t, cost), because the map t — P'(t) would move around the 
unit circle without any sign condition. One resorts instead to a more elabo- 
rated oscillation pattern, where the map t — P'(t) does not move around a 
circle, but rather oscillates forward and backward on an arc of a circle, with 
non-uniform speed so that its average vanishes (and the primitive ® can be 
periodic). The precise definition of y = @ is given in (4.5) below. 

We now give the proof for the two-dimensional case. Our starting point is 
the map u € C% (w; R?) constructed in Step 3. Since ui ^u,2 4 0 everywhere, 
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Fig. 4.3. The curves yi(t) (full curve) and y(t) (dashed curve) given in (4.5), 
as functions of t. Note that pı becomes negative only in a region where p2 is well 
separated from zero. The arc covered by the point (¥1, ¢2) is illustrated in Figure 4.4. 


there exists a p > 0 such that |u, A u,2| > p > 0 everywhere, and the normal 
vector V = ui Au2/|ui A up2| is smooth on ©. Let py be again a smooth, 
nonnegative cutoff function such that Yy = 1 on I, and Yy = 0 outside Ion. 
We set, for € > 0 to be chosen later, 


v(x) = u(x) + ew, (x) |, (=) p+ Bo (=) v| i 


Here u = ui/|u| and 1,2 are 27-periodic primitives of 


cos(a sint sin(asint 
spo g ee (4.5) 
sin Q sin Q 


with ©, (0) = 0. The number a € (0,7) is chosen so that yı has average 
zero over its period (0,27) (the existence of a follows from the mean-value 
theorem; numerically, a ~ 2.4, see also Fig. 4.3). We find that 


Vu = Vu + (onpit Ynev) 8 e1 + Ole). (4.6) 


Here and below, we denote by O(¢) quantities which are uniformly bounded 

by Cujn€, where we write Cy, for generic constants which can depend on u 

(specifically, on ||u||c2 and min |u, Au.2|), on 7, on I’ and on w, but not one. 
Consider now the derivative vı. Up to O(e) terms, it coincides with 


2(a,t) = (Ju 1|(2) + dy (@) yr (t)) u(x) + Yn(@) p2(t)v (x) (4.7) 
evaluated at t = zı /¢. Considering z as a function of t for fixed x, we see that 


|z| > max(,, |u,1| sin a) (4.8) 
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Fig. 4.4. For fixed x, the values of t — z(x,t) (see (4.7)) are located on an arc 
of a circle in the p, v plane. The radius is w/sina, the center C = (m,0), where 
m = |u| > 0. The closest point to the origin is A = (m — y cot a, Y) (since two 
distinct circles intersect in at most two points), and it corresponds to t = 7/2. Its 
distance |A| from the origin O = (0,0) is greater than or equal to the distance d of 
O to the orthogonal projection of O onto the line through A and C. This distance 
is given by d = msin(a — a) = msina. At the same time, |A| > |A2| = 4%. 


(see Fig. 4.4). In particular, in the region where Yy = 1 (covering Ty) [v| > 
1+ O(e), whereas on the rest |v,1| > (sina)|u1|+ O(e). We finally verify that 
the wedge product v, A v,2 does not vanish (the uniform lower bound on its 
length will be obtained only after superposition of additional oscillations in 
the next step). A direct computation shows that 


yp 
vi Av = u1 ^uo] (: + hea] v + |u lpp u + Ole) 
a 


where pi’ = vAUu.2/|u,2|. The two unit vectors v and py’ are orthogonal. Further, 
Jui A u2|/lu1| < |u|. Therefore 


(1 + ban v1) y+ Wa (pe u 
lu] |u,1| 


+ Oe). 


lui Avol > lua Aue 








Reasoning as in (4.7-4.8) we get 
[v1 Av | > ju, A uz] max (Eys a) + Ofe). 
u 


Now we choose £ > 0 (depending on Cu,ņn) so small that all the O(e) terms 
in the foregoing equations are less than 1/4 of the leading-order terms. The 
function v defined in this way belongs to C° (©) and satisfies ||v||c2 < Cu,n 
and |v 1 A v,2| > [u1 A u.2|/2. Moreover, |v 1| > 3/4 in I). 

We finally insert oscillations in the orthogonal direction by defining 


w(x) = v(x) + Ey, (2) [e (2) i+ Po (=) P| 


€ 
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Here ®; 9 are the same functions as above, and 


= viANve = yA VA yA VA 


y=—— H a 


a “Enol Pal 


Note that (v 1/|v,1|, 7, #2) form an orthonormal basis. We find 
Vu = Vv + vy (Yi + p20) Q e2 + OE), (4.9) 
and by the definition of the basis vectors, 
w, Aw, = v, A (v,2 + Yn (piñ + p20)) + OF) 


= v1 A (A: v2 + Yp) A t+ dyed) + OE) 
= (H - v,2 + Yng) v, AA+ Yng v1 AT + OCE). 


The vectors v, A jt and v, AT are orthogonal, both have length |v,1|, and 


M j= (TA v1) £ V2 = (vı A v2) -v = lua A v 2|? 7” lua A v al 
vg = oS Oo = M 
l lv,1| lv,1| [v1] lv, Av, lv,1| 


Reasoning as in (4.7-4.8) above (see also Fig. 4.4) we obtain 


A 
|w, A w,2| > |v] max (vn —— sin a) + O(é). 
UA 


We choose £ so that the last correction is smaller than 


1 1 
min | —, -min |v; Avol sina ]. 
(G o lua 2 ) 


On Ty we get |w A w,2| > |u| —1/6 > 1/2, and globally 
1 
wa Aw| > lui Avy2|sina + O(é) > gle Avg|sina. 


The estimate for |Vw| follows from (4.6) and (4.9) and concludes the proof. 


5 An extension subject to the incompressibility 
constraint 


The next important step in the construction of a recovery sequence is the 
extension of a regular map w : w —> R? to a map un : Q, —> R? with 
det Vu» = 1 everywhere. In view of the previous results we may assume that 
rank(Vw) = 2 everywhere. Heuristically, one can think of extending w by 
setting u(x, z3) = w(x) + x3v(x), where 

ðw TAN zw 


v = ————,; 5.1 
|O.w A dow]? ( ) 
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(v here does not represent a unit vector, but rather a scaled normal). In the 
case that w is itself affine, the deformation gradient turns out to be Vu = 
(Vw,v), which has determinant one. However, if the function w is not affine 
one obtains additional terms from the x, and zə derivatives of v. These will 
be compensated for via a nonlinear change of coordinates. 

Consider now the control of the energy. We focus here on the situation 
that Wop(Vw) is small, and seek u, so that W3p(Vup,) is correspondingly 
small. The additional step of passing from W3,, to Wap will be dealt with in 
Lemma 6.2 below. The definition of Wap in terms of W3p shows that, at least 
on large parts of the domain, the x3 derivative of up should not be v, but rather 
b as obtained from the minimization in (3.1). This b depends in a discontinuous 
way on Vw (generically), hence a suitable smoothing-interpolation scheme 
will be needed (see Proof of Theorem 2.4(iii) in Sect. 6). Therefore in the next 
proposition we assume only that v satisfies the constraint 


(ôw A Oow)-y=1 inw. (5.2) 


Proposition 5.1. Let w,v € C®(©;R?) satisfy (5.2) on w. Then there ex- 
ist an ho > 0 and an extension v € C®(© x (—ho, ho); R?) such that 
v(x,0) = w(x) and det Vu = 1 everywhere. Moreover, for all x3 E€ (—ho, ho) 
the pointwise bound 

|Vv —(Vwu|v)| < Cag (5.3) 


holds, where C can depend on w and v. 


Remark: If we define vp to be the restriction of v to Rn, for h < ho, then 
(5.3) implies that Trun —> Tow strongly in W!?(Q;R3) as h — 0. 


Proof. We define u € C® (Q; R?) by u(x, x3) = w(x) + 23v(x) and compute 
det Vu = det (Vw + 23Vv |v) = 1 + z3P(x) + 23Q(a) (5.4) 


where P and Q are combinations of the components of v, Vv, and Vw, which 
are all uniformly bounded in w. Hence for small x3 the determinant is close to 
one, and in particular | det Vu — 1| < Ca3 (C depends on w and v). We define 
v = uo®, with S(x, 23) = (x, p(x, x3)), and construct p so that det Vv = 
(det Vu o &) 03y = 1. Therefore y has to be a solution of the ODE 


1 


Dees) = Sue. Oey) 


subject to y(x,0) = 0. It follows from standard results in the theory of 
parameter-dependent ODEs that y is smooth, and the previous bound on 
det Vu implies that |y(a, x3) — x3| < Cx. Hence there is ho > 0 such that v 
is smooth on © x (—ho, ho), with det Vv = 1. 

The next step is the estimate for the gradient of v. By the chain rule, this 
requires a good control of the gradient of y. To this end we rewrite the ODE 


Thin Membranes with Incompressibility 241 


as an integral equation and differentiate. Let F be a primitive of det Vu with 
respect to 23, i.e. 


t 
Hedel det Vu(x, s) ds. 
0 


Then the ODE above is equivalent to the implicit equation F(x, (x, x3)) = 
x3. Differentiating with respect to xı we obtain 


plzz3) 9 Ə 
| — det Vu(z, s) ds + det Vu (x, p(x, x3)) — y(x, x3) = 0. 
0 Ox Ox 


The expansion in (5.4) shows that for 73 < ho the determinant det Vu takes 
values between 1/2 and 2. Taking the xı derivative of the same expression, 


we see that also 0; det Vu is bounded by C3 (C depends here on v and on 
the first and second gradients of w and v). It follows that 


(2) 
lAry| < zi |ð; det Vul ds < Cz. 
0 


The estimate for 02y is analogous. We finally observe that v(x, x3) = w(x) + 
p(x, %3)v(x), and hence 


Vv = (ðw + dåp v + pO V|Oqw + Oopy + y ôv|ðzp v). 


Since |0,y| + |O2p| < Cx3, and |d3y — 1| < Czx3, we obtain (5.3), with C 
depending on w and v. 


6 Construction of the recovery sequence 


For any function u € W1?(w;R?) we need to construct functions up € 
W?1?(Q);R°) such that Trun converges weakly in W1? to Jou and 


lim sup Ip [un] < J[ul. 
h-0 

In doing so we need to correctly account for the two relaxation steps, which 
lead from W3p to Wap and then to W3p. It is natural to proceed in the 
reverse order, and start by constructing a sequence uj such that the integral 
of Wop(Vu;) converges to the integral Wj) (Vu). We first reduce to the case 
of piecewise affine functions u, and then insert a suitable test function in each 
affine piece. 


Lemma 6.1. For any u € W1?(w;R?) there is a sequence uj € PŁN 
WEP(w; R3) such that uj > u strongly in W+? and 


lim J Wiv(vujae = f W5o(Vu) ar. 


j-oo 
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Proof. By density of P! O W}?(w;R°) in W1?(w;R°) there is a sequence 
uj € PŁ OW"? converging to u in W!?. The energy converges since W3, is 
continuous and has p-growth from above (Lemma 3.1, (iii) and (iv)). Indeed, 
it suffices to apply Fatou’s lemma first to W3)(Vu,) and then to C|Vu,|? — 
Wp (Vu;). The triangulation underlying uj is denoted by TO). 


Lemma 6.2. For any u € W1?(w; R3) there is a sequence of functions zj € 
Pt AW? ?(w;R3) such that zj + u weakly in W1? (w; R?) and 


limsup / Wop(Vz;) da < 1 Wp(Vu)dz. 
jnco Jw w 

Proof. Let uj be the sequence obtained in Lemma 6.1, and fix n; — 0. By 
definition, Vu; is constant in each of the at most countably many triangles 
TY ) We shall modify it separately in each of them. For any TY ) we can find 
by (ii) of Lemma 3.1 a function p e Pl o Wwa”( 9) such that 


1 


— | Wap(Vuj + Ve) dx < W3p(Vuj) +, 
IT| TO) 


and Jro Pp dz < mITP|. We define z; = uj + yp) in each TO, We have 


J Wao z)de < f Wolu) ae + nj and J l= lP dx < mlel; 


Using the coercivity of the energy, this implies that z; is uniformly bounded 
in W1?(w;R?). Since zj > u in LP, by the uniqueness of the weak limit, zj 
converges weakly in W1:?(w; R3) to u. 


In order to use the results from Sect. 4 we need to pass to smooth functions. 
This can be accomplished by convolution. However, we need to ensure that the 
smoothing does not modify the energy significantly. This is not entirely trivial, 
since we are now dealing with the non-convex, unbounded energy Wap. In this 
smoothing step we make again use of the fact that the sequence is composed 
of piecewise affine functions. Precisely, we show that the smoothing does not 
change zj away from the “jump set” I, which is defined as the union of 
the boundaries of the countably many triangles on which z; is affine. This 
argument is for a single z;, for uniformity we keep the index j. 


Lemma 6.3. Let zj € P N W!?(w;R°). Then for any n > 0 there exists 
ajn E C” (©; R?) such that 


jn =Z; onw\Ijn, and ui : (1+ |Va;n P) da =0. 
jn 


Here T; = ðw U U, aT), where BOM se are the triangles on which zj is 
affine, and 
Tym = {x € w : dist(x, Ij) <n}. 
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This implies immediately that aj, > zj in W'?(w;R*) as n > 0. 


Proof. It is east to see that 
ia \[jn|=0 and imi ie Vey Pde = 0. (6.1) 


To construct the functions aj, we choose an extension zj € W1?(R?;R°) 
(we use the same letter to denote this extension). We then mollify z; with a 
kernel supported on a ball of radius 1/2, and restrict the result to ©. Since zj 
is locally affine on w \ Ij, we have ajn = zj on w \ Ij,y. Further, 


| E E i: 
T; T 


jn j, 2n 


Vajas + f |Vz,|? dz, 

(3w)2ən 

where (Qw)o, = {x € R? : dist(x, ðw) < 2n}. Both terms tend to zero as 
n — 0, hence aj > zj in WHP, 


The following proposition summarizes what we have accomplished so far in 
this section, and incorporates the result of Sect. 4. 


Proposition 6.4. Let w be a bounded Lipschitz domain in R?, and suppose 
that W3p satisfies (2.10-2.11). Then for any u € W1? (w; R) there exists a se- 
quence of functions uj E€ C® (©; R?) such that uj + u weakly in WP (w; R3), 
rank Vu; = 2 everywhere, and 


limsup f Wan(Vuj)dx < | Wip(Vu) de. 


j—> o0 w 


Moreover, each of the functions uj is affine on finitely many triangles TY C 
w, n= 1,...,N(j), where we may choose N(j) such that the complement of 
these triangles is small in the sense that 


NG) 

limsup f (Wop(Vu;)+1)de=0 with wj =w \ U TO), (6.2) 
ITT OOT VR n=1 

Proof. Let zj be the sequence given in Lemma 6.2. For each fixed j, and each 
n > 0, let aj be as in Lemma 6.3. We shall now approximate the latter 
with regular functions. In order to do this, we apply Proposition 4.1 to the 
functions a;,,, and denote the result by 6,,, (if w is not connected, it suffices to 
treat each connected component separately). On I 2,, we estimate the energy 
by means of the upper bound in the growth condition (3.2), giving 


Wop(Vbjn) da < cf (IVb jn)? + +1) de. 
Tan 


Ton |01j,n A 02b5,|? 


The term |Vb,,,|? + 1 is by (4.1) bounded by a constant times |Vaj,,|? + 1. 
The second term in the integrand is bounded on I} by a universal constant 
by (4.2), and on I>, \ Ih, again in view of (4.1) and (3.2), by 
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1 c 
o ca A Wal Na; = cWop(Vz; À 
[ibin Azbi Otim A azn ~~ BANE) E YT 


where in the last step we used that Vaj = Vz; outside 1}. We obtain 


Wap(Vb;,,) da < cf (Wan (V z5) + |Va;,n P) dæ + cellon]. 


Ion Ton 


Both quantities converge to zero as 7 — 0. Since by construction bjn = aj,n 
on w \ Ibn, we conclude 


lim f Wap(Vbjn) de < / Wp (Yz) de. 
er w 


w 


Moreover, 
/ [Vb; n — Vz;|? da < cf (IVb; + |V zP) da < cf (IVz;|P +1) dx 
w Ian Don 


tends to zero as 7 — 0, hence bjn > zj strongly in W1? (w; R3). Taking a suit- 
able diagonal subsequence we can choose uj = bj.,(;) such that uj; converges 
weakly in W1? (w; R3) to u. This concludes the proof. 


Proof of Theorem 2.4 (iii). Given u € W1?(w;IR*) and a sequence hj — 0, 
we need to construct uj E€ C°(Q),,;R®) such that 


lim sup Jn, [uj] < J[u] 
jroo 
and Thuj — Tou weakly in W1?(Q;R*). By Proposition 6.4 we can find 
uj E€ C® (©; R?) such that uj — u in WHP, rank Vu; = 2 everywhere, and 


limsup | Wap(Vu;)de < | W5p(Vu) ae, (6.3) 


j—oo 


For each j there are finitely many disjoint triangles TË c w,n=1,...,N(j), 
such that uj is affine on each of these triangles, and such that wj = 
w \ UOT) is small, in the sense of (6.2). We now use Proposition 5.1 to 


construct the extensions to the sets w x (0, hj). For simplicity we drop in the 
following argument the index j from all quantities except uj. We first choose 
the field v, which will determine the derivative of u; in the x3 direction. We 
set (see (5.1)) vo = (ru; A Ogu;)/|O1u; A Ayu; |? and then modify vo inside 
each of the finitely many triangles on which u; is affine. Let Tn be one of 
them, Fp = Vu; on Ta, and bn € R? be such that 


W3sp (Fron) = Wop (Fr) = min Wsp (Fala) : 


By the growth condition |bn| < c(|Fn| + |vo| + 1), with c depending only on 
Wo. We define a smooth vector field v by 
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he volz) + Ynlx)(bn —vo(x)) ifaeT,, 
Vo(x) else. 


Here Yn E€ CSO (Ln; (0, 1]) is a cutoff function such that Yn = 1 on a subset T7 
of Tn of area at least (1 — 7)|T;,|. Since on Tn 


0= det(F'|b) = det(F|vo) = (Fy N Fə) ; (b = vo) 
(we dropped here the index n as well) we obtain 
(ruj A Oou;)-v = (Fi A Fa) : vo + Y (Fi A Fy): (b — vo) =1, 


ie. (5.2) is satisfied. Finally, consider the enlarged exceptional set po = 
w \ UTM. From the above estimates it follows that 


lim lim sp f À (Wop(Vu;) +1) dx =0. (6.4) 
ro 


Jao n—0 


We now apply Proposition 5.1 to the pair (u;,v), to obtain a function 
vj E C™(Q,,;R*), with det Vu; = 1 everywhere. By the uniform estimate 
(5.3) and the continuity of Wo we can estimate the energy, for h < hj, by 


1 
Inlv;] < a Wo(Vu;) dx dz3 < / Wo(Vu,|v) dz + Ch. 
Nn w 


The constant C can depend on uj and v. We focus on the first term. In most 
of the area, uj is affine and v equals the corresponding bn. On the remainder, 
the energy is small by the growth condition. Precisely, 


/ Wo(Vu,|v) dz < / Wo(Fnl|bn) dx + ef (|Vu5|? + |v]? +1) da 
w UTP dase 


< Wop(Vu;) dx + Os (Wap(Vu;) + 1) dz. 
w I} 
Therefore 


lim sup I;,[v;] < / Woep(Vu;) da + ef i (Wap(Vu;) +1) dz. 
h—0 w T 


By (6.3) and (6.4) we obtain 


lim sup lim sup lim sup J} [v;] < J[u] 
j>œ n>0 h=0 





hence taking a diagonal subsequence concludes the proof. 
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1 Introduction 


1.1 Multiscale problems in micromagnetics 


Ferromagnetic materials show a fascinating variety of magnetization patterns 
on scales ranging from a few nanometers to hundreds of microns. The for- 
mation and evolution of these patterns is at the heart of numerous magnetic 
devices, including the ubiquitous magnetic storage media. Somewhat surpris- 
ingly, this large variety of patterns can be understood as (local) minimizers 
of a simple, yet subtle, functional, the micromagnetic energy. Their dynam- 
ics is described by an associated evolution equation, the Landau-Lifshitz (or 
Landau-Lifschitz-Gilbert) equation, which combines Hamilitonian and dissi- 
pative aspects. 

Until recently the micromagnetic energy was mostly analyzed in one of 
two ways. The first approach is to consider special ansatz functions (inspired 
by physical intuition) with a few free parameters and then to optimize over 
these parameters. While this approach has lead to valuable insights, it is also 
limited in its scope. In particular one cannot detect something which has not 
been put in the ansatz. The second approach is large scale computation. This 
has been successful for answering specific questions for submicron devices. 
Due to the wide separation of the relevant scales, direct numerical simulation 
is, however, restricted to the smallest scales and cannot cover the full picture. 
Perhaps even more importantly, it answers specific questions, but provides 
little insight in general principles and understanding. 

In the last decade a new approach to micromagnetics has emerged, and the 
SPP 1095 has had an important impact in shaping it. This approach is based 
on two ideas. First, considerable insight can be gained by the identification of 
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optimal scaling laws involving the natural parameters, such as material con- 
stants or geometric quantities, and the corresponding magnetization patterns. 
This amounts to establishing upper and lower bounds on the micromagnetic 
energy. While for the former one can often rely on intuition and test func- 
tions developed in the physics community, the latter requires in general new 
mathematical ideas. The second approach is to derive simplified theories in 
certain limiting parameter regimes, e.g., for thin films. These theories reduce 
the complexity of the magnetic energy landscape (and the dynamics in that 
landscape) and allow one to get a better insight into the essential structures. 

In this paper we focus on the mathematical analysis of the statics and 
dynamics of magnetization structures in thin films. For a broader review of 
recent developments, written for a more general science audience, we refer to 
[DK*05]. 


1.2 The micromagnetic energy and associated variational problems 


We first discuss the functional from which all our results are ultimately de- 
rived, the micromagnetic energy. This energy is a sum of terms of various 
types. Depending on certain material parameters and on the shape and size 
of the ferromagnetic sample, any of these terms can play a dominant role, 
or an interplay between several of them can take place. This explains the 
multitude of different patterns derived from this theory. 

We consider an open domain 2 C R? which represents the ferromagnetic 
body that we study. The magnetization of this body is given by a vector field 
m : N — R®. Below the Curie temperature, the magnetization is saturated, 
which means that m is of constant length. We use a normalization such that m 
has values in the unit sphere S?. Now we consider several energies associated 
to m. 

The so-called exchange energy models the tendency towards parallel align- 
ment of neighboring magnetization vectors in the underlying atomic lattice. 


It is given by the functional 
2 
as I |Vm|? dx. 
2 
R 


Here d is a material constant, called the exchange length. 
The ferromagnetic material may have crystalline anisotropies which prefer 
certain directions of m. An integral of the form 


Q | om) ax 
Q 
represents such anisotropies. Here ¢ : S? — [0,00) is a fixed function and Q 


is another material constant. Usually ¢ is assumed to be smooth, often even 
a polynomial. 
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The magnetization induces a magnetic field, often called the stray field 
or demagnetizing field, which obeys the static Maxwell equations. It can be 
represented by a potential u : R? — R which solves the equation 


Au = div(yem) in R. 


Here xg is the characteristic function of 2 (in other words, we extend m 
by 0 outside of 2). If 2 is bounded, the saturation condition guarantees that 
m € LP(N, R?) for every p € [1, co]. Hence there exists a unique solution of this 
equation in the Sobolev space H'(R*). The induced field is then represented 
by —Vu. Its energy is called the magnetostatic energy and given by 


1 
> f Ra dx. 
2 Jre 


The interaction with an external field h : R? — R? induces an energy term 


- f| hem ax 
R 


that prefers alignment of the magnetization with the external field. In this 
paper, applied fields appear as driving forces in the context of moving domain 
walls. 

The micromagnetic energy is the sum of all four energies, that is, 


E(m) = F f maxo f oo) acts f, Vul? dx- f nmax. (1.1) 


The exchange term is of leading order in this functional, but the constant in 
front of it is typically small. The other three terms are of order 0, but one 
of them (the magnetostatic energy) involves the non-local pseudo-differential 
operator V AT! div. In some situations, its behavior is quite different from the 
behavior of the other terms. Under certain conditions, these energies may be 
in competition with one another. For instance, the exchange energy favors con- 
stant magnetizations, whereas the magnetostatic energy prefers vector fields 
which are divergence free (in R*). Because of the jump at the boundary, the 
two conditions cannot be satisfied simultaneously. The anisotropy term, on 
the other hand, may not penalize a varying vector field in principle, at least 
not if the function ¢ has several minima (which is usually the case), but it 
favors rapid transitions between different states — unlike the exchange energy. 
An analysis of such interplays can explain some of the observed patterns in 
ferromagnets. 

We study two types of variational problems associated to the micromag- 
netic energy. Minimizers of E, or more generally, local minimizers and (stable) 
critical points represent the stable magnetization patterns of our ferromagnet. 
If we write 

Vr: E(m) = -d Am + QV¢(m) + Vu-h (1.2) 
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for the L?-gradient of E (without the saturation constraint), then these vari- 
ational problems give rise to the Euler-Lagrange equation 


(l1-m@®m)V72E(m) = V;2E(m) —(m-Vz2E(m))m=0 in 2. 


(Here 1 denotes the identity (3 x 3)-matrix.) That is, the projection of 
Vz2E(m) onto the tangent space TS? vanishes. This equation can also be 
expressed in the form 


(Am + |Vm|?m) — QV¢(m) + (l1—-m®m)(h- Vu) =0 in @. 
Moreover, we have homogeneous Neumann boundary conditions 


o 
=> =0 on ôN. 
A model for the dynamical behavior of the magnetization is given by the 
Landau-Lifshitz equation 


om om 

— mA Vzr:-E&E(m m /\ —=0 1.3 

ao + ym A Vz E(m) + am A = 0, (1.3) 
also called the Landau-Lifshitz-Gilbert equation. Here A denotes the vector 
product in R. Both y and a are fixed constants, and we require that ay > 0. 
Another common way to write the equation is 


= = 4m AV ,2E(m) + âm (mA V,2E(m)). (1.4) 


The two versions are equivalent for 


X Q A 
Ogee A 
The terms in (1.3) and (1.4) with coefficients y and 4, respectively, describe 
a magnetic precession. We call them the gyromagnetic terms. The terms with 
coefficients a and â, respectively, are damping terms (hence the sign condition 
on ay, giving rise to the condition â > 0). From the mathematical point of 
view, the damping terms are the more important ones, because they make 
the problem parabolic. Without them, the equations would be of the type of 
a nonlinear Schrodinger equation. 

Taking the vector product with m in all terms of (1.3), we obtain a third 
equivalent version of the equation, 


„ôm ôm 
a + mA = = (m8m -— 1)Vz2E(m), R 


where 
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Representing the equation in this form is convenient because it underlines the 
similarity to the negative L?-gradient flow for the functional E subject to the 
constraint |m| = 1. (In fact this gradient flow is (1.5) for & = 1 and ¥ = 0.) 
We will normally use the Landau-Lifshitz equation in the form (1.5). 

It is natural to impose a homogeneous Neumann boundary condition also 
for the Landau-Lifshitz equation. 

Apart from the obvious quantities d, Q, ¢, and h, the qualitative behavior 
of E and of solutions of the above variational problems also depends on the 
shape and the size of the sample 2. Some idea of the dependence on the 
size can be gained by studying the scaling properties of the four terms which 
contribute to the micromagnetic energy. Suppose for a number A > 0, we 
replace 2 by AQ and the vector field m by m(a/A) (and similarly h by 
h(a/A)). Then the exchange energy is multiplied by the factor À, whereas 
the other energy terms are multiplied by \3. Thus it is to be expected that 
for a very small sample, the exchange energy determines the behavior of m 
to a large extent; that is, a minimizer of E is nearly constant. For a very 
large sample, on the other hand, the exchange energy is insignificant, and the 
behavior of m is ruled by the other terms. 

On the other hand, we can use rescalings to eliminate one of the parameters 
in our problem. Replacing 2 by AQ, and replacing simultaneously d by Ad, we 
obtain a functional whose energy landscape differs only by a constant. This 
way we can normalize the problem such that either d or Q become 1, or such 
that 2 has unit size. 


1.3 Thin films 


The results in this paper are concerned with ferromagnetic bodies in the shape 
of thin films. That is, we consider domains of the form 


Qs = N x (0,6), 


where (2 is now a two-dimensional domain and 6 > 0 is small compared 
with the size of 2. We either study the limit behavior of the micromagnetic 
energy and its variational problems as ô N 0, or we use the thinness of 25 
as a justification for working directly in two dimensions and with an energy 
that approximates the micromagnetic energy for thin films. In both cases, the 
projection of m onto the plane R? x {0} and the third component of m play 
different roles. It is therefore convenient to use the notation m = (m, mg) for 
the magnetization vector field, where m = (m1, mz). Similarly we often write 
x = (x, x3) = (£1, £2, £3) for a generic point in RÌ. Sometimes, however, it is 
more convenient to use coordinates (x, y, z) in R3. 

The reduction to two dimensions — whether by a rigorous asymptotic anal- 
ysis or formally — decreases the complexity of the problems that we study. 
Nevertheless, a rich variety of patterns can still be observed, and there exist 
different asymptotic regimes for the thin-film limit which give rise to different 
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reduced theories. These regimes are determined by certain relations between 
the parameters involved in the problem, of which we have now four (under 
the assumption that the shape of £2 is fixed, but the size can still be varied by 
scaling): In addition to the material constants d and Q, we have the thickness 
ô and a length scale L of the cross-section 2. When we are not interested 
in the behavior of m near 02, we may assume 2 = R?, and then L need 
not be considered. If we choose to neglect certain terms of the micromagnetic 
energy, this may of course reduce the number of parameters further. For in- 
stance, if we consider only the exchange energy and the magnetostatic energy 
(which we do in a substantial part of this paper), then the asymptotic regime 
depends on the behavior of the ratio d/L as we let the aspect ratio 6/L con- 
verge to 0. Some asymptotic regimes for this thin-film limit have been studied 
by Gioia and James [GJ97]; Carbou [Car01]; DeSimone, Kohn, Miiller, and 
Otto [DK*02]; and Kohn and Slastikov [KS05a, KS05b]. Another regime is 
discussed in this paper, first through a simplified model in two dimensions, 
then by an asymptotic analysis for the micromagnetic energy on Ns for ô N 0. 
This theory also establishes a link to the theory of Ginzburg-Landau vortices, 
which were first studied by Bethuel, Brezis, and Hélein [BBH93, BBH94]. 

A further dimensional reduction is made in the context of parametrized 
domain wall models, that we investigate in detail. Such models represent the 
basic building blocks within larger domain patterns or more complex domain 
wall structures. Of particular interest is the regime of weakly anisotropic (soft) 
thin films, where such transition layers significantly differ from those more 
common in phase transitions. 


2 Domain Walls: Internal Structure and Dynamics 


The primary phenomenon that one associates with magnetic pattern forma- 
tion is the decomposition of a magnetic body into almost uniformly magne- 
tized regions. The so-called magnetic domains are separated by thin transition 
layers, called domain walls, that interact in a complex network. The structure 
of such domain walls is among the central concerns of micromagnetic theory. 
While the analysis of domain walls is mathematically an interesting matter 
of its own, the physical relevance relies in the resulting mutual interaction 
having large impact on the global magnetic microstructure, especially when 
nonlocal effects dominate. In reduced thin-film theories, domain walls often 
emerge as line singularities while fine structural properties no longer have any 
effect. Breaking the resulting degeneracy by means of transparent selection 
principles rising from higher order contributions remains a major challenge. 
On the other hand, magnetic domain walls can exhibit internal substructures 
themselves or can be made up as a complex composite, such as the cross-tie 
wall, cf. [HS98] pp. 240-241. 

The simplest domain wall patterns are one-dimensional and appear as extreme 
cases in a hierarchy of domain wall models that emerge in diverse parameter 
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regimes: Within a Bloch wall, the magnetization vector performs a rotation 
perpendicular to the transition axis. The main feature is the avoidance of mag- 
netic volume charges, so that this wall type is energetically favorable in bulk 
situations and essentially equivalent to the transition problem arising from 
Cahn-Hilliard models. Such models exhibit sharply localized and rapidly de- 
caying transition profiles. Our analysis shows that this behavior can largely 
change when nonlocal interactions dominate and internal length scales fail to 
be determined by dimensional analysis. The Néel wall, where the transition 
proceeds in-plane, is preferred in suitable thin-film regimes and characterized 
by the avoidance of magnetic surface charge. The presence of three energy 
components with different scaling behavior gives rise to multiple length scales. 
The typical feature of a Néel wall is the very long logarithmic tail of tran- 
sition profiles. Such behavior has been predicted by heuristic arguments and 
numerical simulation (cf. [RS71, Gra99]) in order to explain long-range in- 
teraction of Néel walls, when neighboring tails overlap. Here we demonstrate 
rigorously how the main analytical feature of the variational principle, a crit- 
ical regularity property, gives rise to the typical logarithmic decay behavior 
[Mel02, Mel03, Mel04a]. This global approach served in addition to resolve 
the spatial scaling laws in terms of all involved parameters and to derive a 
somewhat universal limiting profile that reflects the decay. 

The evolution of magnetic patterns in the presence of applied fields is closely 
related to the motion of domain walls. Gyrotropic domain wall motion is based 
on the Landau-Lifshitz-Gilbert (LLG) equations, that describe a damped pre- 
cession of the magnetization vector about the effective field, i.e. mathemat- 
ically a hybrid heat and Schrödinger flow for the free energy. An appropri- 
ate local description relies on the concept of moving fronts that propagate 
with constant speed. Traveling wave solutions for the associate LLG dynam- 
ics represent a natural dynamic counterpart to static domain walls. As it 
turns out they provide valuable insight into the mechanisms and properties of 
domain wall dynamics, where besides energetics and spatial structures, kine- 
matic quantities as wall mobility and wall mass come into play. 

Whereas in the equilibrium case the magnetization path is dictated by ener- 
getics and particularly stray-field interactions, a second mechanism effects the 
shape of a moving domain wall: the precession dynamics as prescribed by LLG 
pushes the magnetization vector away from its optimal path, taking into ac- 
count a gain in stray-field energy. Many interesting effects originate from this 
balance of energetic and dynamic forces, especially when enhanced by strong 
shape anisotropy in the regime of thin films. The bulk situation, however, can 
surprisingly be solved explicitly by means of a famous construction by Walker, 
i.e. a tilted version of the Landau and Lifshitz solution for the standard Bloch 
wall, cf. [Wal63]. Again all spatial and temporal scales can be read off by 
dimensional reasoning. The natural question on whether such a construction 
can be perturbed to the regime of finite layers has been answered in the 
affirmative [Mel04b]. Indeed, a suitable choice of canonical coordinates trans- 
forms the associate LLG system into a weakly coupled Schrédinger/reaction- 
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diffusion system and makes it accessible for spectral methods. The analysis 
also demonstrates that the finite layer perturbation is indeed a singular one 
and how this relates to slow decay. 

In the regime of thin films the competition between stray field and preces- 
sion is singular, that is to say, the asymptotically hard constraint of in-plane 
magnetization is geometrically incompatible with LLG. In order to derive an 
effective evolution equation, the change of spatial scaling has to be accompa- 
nied with a change of times scale. An effective thin-film limit for LLG with 
finite Gilbert damping has been carried out in [EG01, KS05a] and in Thm. 4.9, 
where the gyromagnetic precession term effectively turns into a large damp- 
ing term as well. While the overall relaxation dynamics is captured correctly, 
oscillatory phenomena, such as spin waves or domain wall resonances, are sup- 
pressed in such a limit. In order to account for these effects we consider the 
complementary regime where Gilbert damping is comparable to the relative 
thickness. We show that in this regime LLG keeps its oscillatory features and 
turns into a damped nonlocal wave map equation [CMO06]. In the context 
of domain wall motion it provides a mechanical analogy and sheds some new 
light on the notion of wall mass. For small applied fields, the traveling wave 
problem, modeled on this wave-type dynamics, reduces to the question of 
linear stability for stationary Néel walls. Then the implicit function theorem 
provides existence and determines the mobility of traveling Néel walls. 


2.1 Mathematical framework for planar domain walls 


Let us consider an infinitely extended uniaxial magnetic film that is repre- 
sented by Ns = R? x (0,6) and oriented by the anisotropy (easy) axis R ê2. 
We consider parameterized transitions along R é, (that we call transition axis) 
that connect antipodal states on the easy axis, i.e. 











m:R-—S? with m(+oo) = (0,+1,0). 


In the following we denote the transition parameter by x and the vertical 
coordinate by z, i.e. we set x] = x and x3 = z. Under the hypothesis that, 
within the film, the magnetization varies only along the transition axis, we 
identify the associated global magnetization field m(x) = m(x)x(0,5)(z) that 
is defined for x € R?. Then m = m(x) induces the stray field Vu determined 
by the potential equation Au = V -m in D’(R*). We observe that u = 
u(x, z) where the z dependence only stems from shape anisotropy. Thus the 
micromagnetic energy induces the following averaged domain wall energy per 
unit length: 


E(m) = af fe f mp a+ f Vu:mdo+Q f0- m3) ae} dz. (2.1) 


The vertical average in (2.1) is redundant for the exchange and anisotropy 
portion. In order to perform a dimensional reduction for the stray field, we 
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introduce a reduced stray-field operator 
6 
Ss : m -f Vudz where Au=V-m in D'(R°). 
0 


Changing the order of integration, the averaged stray-field energy can be 
expressed as 


1 fê 1 
Bay) = 5 f [vuma dz=3 f Ss(m)-m de. 


A straightforward calculation shows that the operator Ss has an interpretation 
in terms of Fourier multiplication operators. Indeed, we have 


S5(m) = [o(6D) m,,0, (1 — o(6D)) ms| :R > R, 


where o(D) f = F* (£ = o(€)f(€). The basic Fourier multiplier o(€) is given 
by 


(2.2) 


o(é) = (1 o1 oo x ta for low frequencies £ 


Ig | 1 for high frequencies €. 


The reduced stray-field operator can equivalently be described by means of 
convolution kernel, cf. [Gra99], [Mel03] for a derivation and a detailed discus- 
sion. Accordingly, the reduced stray-field energy can be written as 


Eal) = 5 f OGO ag + 5 f (1 o(88)) brs) ag. (23) 


The advantage of the Fourier representation is that one can easily read off 
the asymptotic form of interaction from the asymptotic behavior of Fourier 
multipliers. From (2.3) one can separate a local contribution and a nonlocal 
one that vanishes in the bulk regime 


Bacay tn) = Z mlet f (1-168) {hs OP — bul? } dg. 2.4) 


Indeed, from (2.2) we deduce that o(ô £) — 1 in the regime when 4|£| — ov. 
Thus, for a corresponding family of transitions m so that mı and mg are 
uniformly bounded in L?(R), we infer that 


Esray(m) = 5 |lmil[32 + 0(1). (2.5) 


We observe that in the bulk regime, the stray-field interaction reduces to 
a local contribution having the form of an additional anisotropy term that 
penalizes magnetizations that point along the transition axis. In the comple- 
mentary thin-film regime when 6|£| — 0 we have o(0€) — d|€| and, for mı 
and m3 uniformly bounded in H1(R), an asymptotic expansion 
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1 ô 
Esray(m) = 5 ||ms|| ža + 7 llmallinre + 08), (2.6) 


where || fla = J IE|| f(€)|2 dE denotes the homogeneous H!/?-norm. The 
zero order contribution can be interpreted as the residual surface charge in- 
teraction having the form of an additional anisotropy that penalizes vertical 
magnetizations. The first order term corresponds to residual volume charge 
interaction. 

From a variational point of view the leading order stray-field contribution in 
(2.5) and (2.6), respectively, determines asymptotically a geodesic magneti- 
zation path. Whereas in the bulk situation the stray field interaction can be 
eliminated completely by choosing a path perpendicular to the transition axis, 
i.e. mı = 0 (Bloch walls), the penalty on the vertical component as 6 — 0 en- 
forces in-plane rotations, i.e. ma = 0 (Néel walls), taking into account internal 
stray fields that typically appear to the leading order, see Fig. 2.1. 

Complete elimination of stray-field interaction cannot be achieved by 
means of one-dimensional transition modes. In somewhat thicker films, how- 
ever, the symmetric Néel wall would lead to comparatively large stray-field 
contribution. But for an attempt to construct a stray-field free transition layer 
one has to abandon the symmetry assumption and to permit variations in the 
vertical direction. Such an object, referred to as asymmetric Bloch wall, 
has been discovered by Hubert, cf. [HS98] pp. 245-249, where volume charges 
are avoided by a vortex construction in the wall center. At the same time nu- 
merical simulations have confirmed a dramatic decrease of energy by breaking 
the wall symmetry. Recently, a rigorous verification based on an ansatz-free 
interpolation argument has been provided by Otto in [Ott02]. 


Bloch 
Walker 


Neel 


Fig. 2.1. The figure shows the Bloch and the Néel wall path as perpendicular 
geodesic connections of antipodal states. It also shows an intermediate geodesic 
that corresponds to a moving domain wall in the bulk regime where dynamic forces 
lead to an inclination towards the Néel wall path, the so-called Walker path. We 
will refer to the polar angle y between the Bloch wall and the Walker path as the 
Walker angle. 
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Bloch walls versus Néel walls 


The infinite Bloch wall in bulk samples has been the first micromagnetic ob- 
ject proposed and calculated in the seminal work by Landau and Lifshitz, 
cf. [LL35]. Once the stray-field energy is fully eliminated by choosing an ap- 
propriate path, the corresponding optimal profile and minimal energy can be 
found by nowadays standard variational sent Indeed, from (2.1) we get 
for m = (0, m2, m3) : R > S? with m2(+o0 1, 


A de et 


The length scale w = ,/d?/Q defines the typical Bloch wall domain width. 
Rescaling by w and renormalizing the energy by the factor 4,/d?Q yields 


=i [im'P ac +2 fO- m ax 


Using the identity |m’|? = (m4)? /(1 — m3) we deduce the optimality relation 























my =1—m3 with me(too)=+1 (2.7) 





that is uniquely solved by m2(x) = tanh(«). Thus in the original scaling 
m(x) = (0, tanh(x/w),sech(x/w)). (2.8) 


Moreover we deduce from (2.7) and Young’s inequality that 
1 i 
E(m) > 5 f |b] de > 1, 


that is attained under equipartition for mə(x) = tanh(x). Thus we re- 
cover from scaling that the Bloch wall energy per unit length is given by 
eo = 4,/d?Q. One may wonder whether the Bloch wall path is indeed opti- 
mal; but this is a simply consequence of |m'|? > (m4)?/(1 — m3) that holds 
true for any m : R — S? € H! and Young’s inequality that imply the same 
lower energy bound. 

Twenty years later, Louis Néel realized that in a regime where the film thick- 
ness becomes comparable to the Bloch wall width, a transition mode within 
the film plane can lower the total energy decisively, cf. [Née55]. New ideas, 
however, had to be developed in order to provide a satisfactory analysis of this 
multiscale object. Indeed, for an in-plane rotation m : R — S! the thin-film 
approximation of (2.1) yields 


d? ô 
=F [mP ds+t fielimig?ac+S f Pas (29) 


Unlike the Bloch wall problem where only two energy components remain that 
can be balanced by a single length scale, the Néel wall problem incorporates 
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two characteristic length scales. Those are connected with the competition of 
two energy components, respectively: In order to highlight the competition 
between stray field and anisotropy we rescale by the tail width w = 6/(2Q). 
With the small aspect ratio Q = 4K? Q where k = d/ ~ 1, we obtain the 
following singular perturbation problem 


Q 1 1 : 
Balm) = 2 pala + Ele + Hala min 2.10) 
m=(u,v):R—S' with u(0)=1 


that captures the logarithmic decay behavior as we will see below. There is 
a second characteristic length scale that is smaller than the tail width and 
related to the width of the core in the very center of the Néel wall. In the 
above regime it coincides merely with the exchange length d, and rescaling 
yields an expression 


1 1 Q 
Limba + cellule + Zlob 


that, as Q tends to zero, highlights the competition between exchange and 
reduced stray-field energy. 


2.2 The logarithmic tail of Néel walls 


The main analytical feature of the variational problem (2.10) is that the en- 
ergy gives only uniform control of the H!/?-norm as Q tends to zero. Since the 
H‘/?(R) norm just fails to control the modulus of continuity, the pointwise 
constraint u(0) = 1 is delicate and one might expect a logarithmic singu- 
larity in a renormalized setting. Logarithmic tails of Néel walls have indeed 
been predicted by heuristic arguments and the resulting very long range in- 
teraction between different Néel walls has important consequences, cf. [HS98] 
pp. 242-245, [RS71] with some extensions in [Gra99]. Logarithmic scaling for 
the energy has recently been established in [Gra99] and the following refined 
version is announced in [DK*99]: 


Theorem 2.1. As Q tends to zero the minimal energies behave like 
inf Eo = 5 (1 + 0(1)) n(1/Q)7* 


where the infimum is taken over transitions that are admissible according to 
(2.10). 


Similar scaling laws have been derived in the case of periodic Néel wall 
arrays, where tails are confined by those of neighboring walls, cf. [DK*03]. 
They were used to heuristically quantify their mutual repulsive force, that is 
particularly interesting in the context of optimal spacing for the cross-tie wall. 
It is remarkable that, in case of finite Néel walls, the above energy asymptotics 
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in Thm. 2.1 holds true when the infimum is taken over y-periodic transitions 
m = m(x, y), cf. [DKO05]. Here, the quality factor Q is replaced by the aspect 
ratio ô/w, i.e. film thickness by tail width, so that Q = 4k? x. The proof is 
based on a dynamic system argument and a sharp interpolation inequality 
between L® and BV. The result proves in particular (nonlinear) stability of 
the one-dimensional Néel wall with respect to two-dimensional variations in 
the plane, a result yet unknown for infinite Néel walls. 

The proof of Thm. 2.1 is based on a perturbation argument; its shows 
that minimal energies exhibit the same asymptotics as the minimal energies 
for the relaxed problem (2.12) to be introduced below. It turns out to provide 
an pointwise logarithmic lower bound as well and motivates the main result 
[Mel03, Mel04a]: 


Theorem 2.2. Let ug be a minimizing profile for the variational principle 
(2.10). Then ug is symmetric-decreasing and exhibits a logarithmic tail in 
the sense that 
uola) = Pa forall OS aS1 and 0< Q< 1/4. 

The notations a S b and a ~ b mean that, for some universal constant 0 < 
c < œ we have a < cb and +b < a < cb, respectively. It can also be shown 
that beyond the logarithmic tail a Néel wall profile decays only quadratically 
In(1/Q) ug(x) ~ £7? as |x| — œœ, cf. [Mel02], a fact that is related to the 
limited regularity of associated Fourier multipliers. Renormalization yields in 
addition a universal limiting profile that captures the essential decay behavior: 


Theorem 2.3. For any sequence Q — 0 so that the corresponding sequence of 
renormalized profiles Ug = \n(1/Q) ug converges in the sense of distributions, 
the weak limit Uo is a multiple of the fundamental solution of the operator 
(—A)!/? + 1, and the convergence is strong in L?,.. 


For fractional derivatives of order s > 0 we use the notation (—A)*/? f = 
Free els f(), where, for s = 2, we have (—A)f = ah. The fundamental 
solution G € L?(R) with (—A)!/?G + G = ôo is well known as a Fourier 
integral. It is smooth away from the origin, symmetrically decreasing and has 
the following expansion: 


aa) = 2 fees SOG wees 


m | £7? + O(a-*) as |x| — oo, 


(2.11) 
where y denotes Euler’s constant. 


Logarithmic lower bounds 


We introduce a linear comparison problem arising from relaxation that can 
be solved explicitly. It is shown in [Mel04a] that relaxation of (2.10) leads to 
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% Q 1 1 ae 
E(u) = Sellen + zler + stelle: — min in {u(0)=1}. (2.12) 


A standard convexity argument implies the existence of a unique minimizer 
that satisfier the Euler-Lagrange equation 


Q(—A)u* + (—A)!/2u* + u* = A(Q) ðo in D'(R). (2.13) 


Expanding the associate Fourier multiplier into partial fractions, (2.13) can 
be solved in terms of the fundamental solution (2.11), so that the following 
properties can be read off: 


Proposition 2.4. The unique solutions ug of the relaxed variational princi- 
ples (2.12) exhibit logarithmic tails in the sense that for 0 < Q < 1/4: 


Uo(z) = Me) (1+ 0(1)) [n(1/z)+roa(x)] for Q<a<1/e, 


where the functions rg(x) are uniformly bounded in the above regime. The 
Lagrange multiplier A(Q) agrees with twice the minimal energy and has the 
asymptotic behavior (cf. Thm. 2.1) 


A(Q) = r(1 + 0(1)) n(1/Q)™*. 


Surprisingly, the relaxed variational principle not only provides an ener- 
getic but also a pointwise lower bound, thus, in view of Proposition 2.4, a 
logarithmic lower bound. 


Proposition 2.5. Let ug be a minimizing profile for variational principle 
(2.10). Then the solution uG of the relaxed variational principle (2.12) is a 
pointwise lower bound. 


Proof. The idea is to derive suitable pseudo-differential inequalities for the 
profiles to be compared. We observe that |m’|? = |u’|?/(1 — u?) and deduce 
the following Euler-Lagrange equation 


of-4 (=) + (a) ef autu=o (2.14) 


that holds true for a Néel wall profile u in {u 4 1}. By Proposition 2.8 this 
equation holds true in R \ {0}. Now the essential ingredients are symmetric 
convexity of comparison profiles stated in Proposition 2.4 and the following 
global maximum principle for the nonlocal field operator (—A)!/?: 


Lemma 2.6. Suppose that the function u € H! (R) is smooth in R \ {0} and 
that u attains a global maximum at zo # 0. Then (—A)'/?u is smooth in a 
neighborhood of zo and (—A)'/?u(ao) > 0. 
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We consider w = u* — u € H! (R) with w(0) = 0. From (2.13) and (2.14) we 
deduce, with the positive coefficient a(x) = Q/(1 — u? (x)), that 


a(x)(—A)w + (4) Pw+w<0 in R\ {0}. 


Since w is smooth away from the origin, the Lemma 2.6 applies and excludes 
a global maximum in R \ {0}. But from Propositions 2.4 and 2.8 we infer that 
w(x) decays as |x| — oo, and the proof is complete. 


Logarithmic upper bounds 


The key observation is that logarithmic upper bounds are captured by sharp 
elliptic regularity bounds that are uniform in Q. For magnetizations mg of 
bounded Néel wall energy (2.10) we have ||wgl|7j1/2 < Eo(ma). Then Sobolev 
embedding implies, for any p € (2,00), a bound |\ug||?, < c(p)Eo(mo). 
A PDE argument, however, shows that for any such p the purely energetic 
argument misses the optimal scaling by a full factor Eg(mq) and provides in 
addition an estimate on the growth of optimal constants. Qualitatively, the 
same is true for fractional Sobolev norms Hi /? that are strictly weaker than 
H2., 


Proposition 2.7. For a critical point mg = (ug, vo) of (2.10) we have 
lluoll <cpEg(mq) for each p€ (1,0) (2.15) 
for some universal constant c > 0 and 
voll a <c(q) Eo(ma) for each q € (1,2), (2.16) 
where the constant c(q) > 0 only depends on q. 
Proof. We outline the main steps: Projection of the Euler-Lagrange system 
VEo(m) = VEo(m) m m (2.17) 


onto its first component equation yields, after a suitable decomposition of the 
non-linearity, an equation of the form 


Q(—A)u + (4)! u + u = eo|m] u + r[u], (2.18) 
where eg|m] = Q |m |?+|(—4)!/tu|?+ |u]? is twice the energy density and rfu] 


is a defect distribution arising from the in compatibility of nonlocal interaction 
and the geometric constraint |m| = 1. In fact, for any test function y, we have 


(riul, p) = ((-A)/4u, [(-4)"4, upu). 


The operator on the left hand side of (2.18) is uniformly first-order elliptic, 
while the right hand side is essentially L1-bounded by the energy, and in that 
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case the claim would follow from a simple Fourier argument. Commutator 
estimates, however, show that uniform bounds for r[u] are slightly weaker 
than L! and rather distributional, i.e in H” for q € (1,2). By means of 
elliptic regularity theory we get a uniform bounds in Hi / 5 and the claim 
follows from asymptotic inequalities for fractional integration. 


From strict rearrangement inequalities that are valid for fractional Sobolev 
norms and a simple bootstrap argument based on the Euler-Lagrange system 
(2.17), we deduce the following symmetry and smoothness result for optimal 
profiles u = ug. 


Proposition 2.8. A Néel wall profile is smooth and symmetrically decreasing. 


Remark 2.9. The above proposition is strict in the sense that a Néel wall 
profile m = et? cannot have a plateau at 0, and the associate phase function 
0 is strictly increasing, cf. Lemma 2.18. 


Proof of Thm. 2.2 


In view of Propositions 2.4 and 2.5 it remains to prove the logarithmic upper 
bound. Let u = ug be a Néel wall profile. Proposition 2.8 implies that the 
pointwise values are below the local averages. Thus Holder’s inequality and 
Proposition 2.7 yield 


z x 1/p 1\ 1⁄2 
0 < u(x) <f u dy < (f |u|? dv) <cp (=) inf Eg 
0 0 x M 


that is a family of upper bounds parameterized by p. The pointwise optimal 
choice of p, given by p(x) = In(1/x), and Thm. 2.1 yield the logarithmic upper 
bound. 


Proof of Thm. 2.3 


From Thm. 2.1 and Proposition 2.7 we deduce that Ug is uniformly bounded 


í 1 
in, say, H, 


strongly on bounded intervals. From the profile equation (2.18) we get 


19 (R), so we can assume that Ug — Up weakly in L?(R) and 


Q(—A)Ue + (—A)"?Ug + Uo = In(1/Q) (elma uo + r[uel): 


Obviously, the left hand side converges to (—A)!/2Ug + Up in the sense of 
distributions. Thus it remains to show that 

(i) the distribution (—A)!/?Uo + Uo is supported at the origin 

(ii) the distribution In(1/Q)r[ug] converges to a finite measure. 
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Claim (i) can be deduced from (2.14) using the uniform pointwise convergence 
of Néel wall profiles to zero away from the origin. Claim (ii) follows from an 
iteration of the arguments in of Proposition 2.7. Indeed, for u = ug we have 
a decomposition of the remainder distribution (r[u], ¢) = (f[u], p) + (g[u], 9), 
where 


( flu], p) = ((~4)" ĉu, [(-A)*/4, up), 
(gļu], p) = ((-A)"/4u, u? (4) tp). 


Product and commutator estimates for fractional derivatives (cf. [Hof98, 
KP88]) and the energy estimate in Thm. 2.1 show that 


(ffl. o) < e [ll zeleli < e 10/97 [lel co- 


Thus we find that the contribution coming from In(Q) f [u] is asymptotically a 
finite measure. On the other hand, according to the Hy! * estimate in Propo- 
sition 2.7, 


[ales o)l Se [elle lll cellll aay2llell nave < ¢ MA/Q)~* [loll ar 


3/2 


so that the contribution from In(Q)g[u] vanishes as a distribution as Q tends 
to 0. 


2.3 Domain wall motion in finite layers 


When an external magnetic field h = H ê is applied that points towards 
the easy axis, the end-states are no longer equally preferred. Consequently, 
one expects the domain wall to become unstable and start to move. In gy- 
rotropic domain wall models the evolution of magnetization distributions is 
characterized by the Landau-Lifshitz-Gilbert equation 


mAQm+aGjm-+y7(l-m®@®m)VE(m)=(l-m@®m)h (2.19) 
m:R x (0,00) > S? with m(+oo,t) = (0,1,0) for te (0,00). 











where £(m) is the internal domain wall energy. We introduce the aspect ratio 
k = d/ô and assume for simplicity that Q = 1. Renormalizing space and 
energy by the exchange length d, we get from (2.1) and (2.4) an (internal) 
domain wall energy of the form 


=} | imi? + 5 [= m3) des 5 [mt de + Gam ), (2.20) 


where the nonlocal portion of the stray-field energy is given by 


Gem) => f (1—o(ne)) (jms? — fm?) ae. e21) 
2 
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Regarding the dynamic problem, a special class of solutions to constant 
coefficient systems are traveling wave solutions, i.e. solutions of the form 
m = m(x+ ct), that describe a motion of constant speed c. In the case 
k = 0 the traveling wave ansatz turns (2.19) into a constrained nonlinear 
system of ordinary differential equations. Surprisingly, these equations can be 
solved explicitly. The solutions are referred to as Walker’s exact solutions, see 
[Wal63, HS98]. Our goal is to show that this situation is indeed generic and 
can be perturbed to layers of large but finite diameter. As this corresponds to 
the case of small nonlocal interaction, the nonlocal character of the equations 
will play a minor role. We will show that, in suitable coordinates, domain wall 
motion according to Landau-Lifshitz dynamics fits into the context of nonlo- 
cal, weakly coupled reaction-diffusion systems. But first we review Walker’s 
construction. 





Walker’s exact solutions 


For « = 0 we have a transition energy 
1 1/2 1 2 1 2 
Eo(m) = > |m"] dx + 5 (1 — mg) da + 5 f/m dx. 
The traveling wave ansatz m = m(x + cyt) yields the system 
cam’+cmAm/+(1—m@®m)VEp(m)=(1—m®m)-h. (2.22) 


The above system can be solved explicitly, a calculation that has been first 
carried out by Walker. The original calculations become more transparent, 
when the equation is considered in the canonical orthogonal frame {m’,m ^ 
m’} on the tangent space of S? along m. It turns out that the following 
assumptions can be met: 


(I) Under the assumption that dissipation compensates the driving force, the 
system decomposes into three equations, 


ca|m’/? = Hm, VEo(m)-m’=0, and VEo(m)-mAm/ = c|m'|?. 


(II) Under the assumption that the wall moves with constant polar inclination 
angle y, i.e. m A w = |m'|v for some constant unit vector v, the energy 
is the Bloch wall energy with increased anisotropy Q(y) = 1 + sin? y, 


Eo(m) = 5 fep dx + oe) fa — m) dz. 


Moreover |m'|? = (m4)? /(1 — m3) and on = |m’|v holds for such m. 


We deduce that, up to scaling, all equations in (I) have the form |m’|? = 
1 — m3, ie. m} = 1 — mî, and can be solved jointly. Matching parameters 


gives the transition profile, inclination angle, and propagation speed 
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= 1+sin2 a, dega aie SOc g 

ma(a) = tanh [y 1+ sin? p 2], sin(29) = —, and c ee 2) 
Obviously a peak velocity ~ 0.4 (i.e. about ~ 10024 for a typical garnet ma- 
terial) is reached at for finite field-strength H beyond which the construction 
breaks down. 
Walker’s construction shows that the dynamics of domain walls in bulks 
samples is accompanied by a decrease of domain wall width by a factor 
(1 + sin? p)? and an increase of domain wall energy by the inverse fac- 
tor. From Walker’s construction we deduce that the first order correction for 
domain wall energies vanishes at small velocities: 


1 
en = eo (1 +sin? p)? = e0 +3 Me +0(c’). 


The second order correction can be viewed as a kinetic energy contribution. 
Thus, the factor M = ey at c= 0 is referred to as the wall mass, a notion 
that has been introduced by Döring, cf. [Dor48]. In the original scaling the 
wall mass is given by M = e/(2d?). We will encounter the wall mass again 
later in the context of traveling Néel walls rising naturally from a wave-type 
interpretation of Landau-Lifshitz-Gilbert dynamics in thin films. 


Stability and perturbation of Walker’s construction 


Theorem 2.10. For sufficiently small field strength H there is a threshold 
k(H) > 0 such that, whenever k < K(H), there is a traveling wave for the 
Landau-Lifshitz-Gilbert dynamics that connects antipodal states. 


Proof. We perform a stability analysis based on a suitable choice of canonical 
coordinates that transforms (2.19) into a weakly coupled 2 x 2 system of 
reaction-diffusion type. For this purpose we combine standard stereographic 
coordinates with a polar rotation by the Walker angle y that maps the Walker 
path into the Bloch wall path. In these coordinates C 3 z + m{z] € S?, 
the Walker path is given by the line segment zo : R — {0} x [-1,1]. For 
m = mfz], functional gradients transform according to \~?(z)V2E,(m) = 
VE,,(m), where A(z) is the conformal factor. Moreover, (2.19) becomes a 
damped Schrodinger equation 


aðiz + 10:2 — y Droz +7 (F(H, 2) + blr, 2)) =0, (2.24) 











where z(-,t) : R — C with z(+00,t) = (0,41). The mapping 
1 
f(H, z) = X7? (2) vaf [50m — m2) — Hmo] dz for m = mfz] 


involves anisotropy, applied field, and the limiting (local) portion of stray- 
field interaction. For small enough H, it is bi-stable in its second component. 
With the notation in (2.21), the map b(K, z) = \~?(z) V-G,.(m) is a nonlocal 
perturbation from stray-field interaction that is continuous in « with b(0, -)=0. 
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Remark 2.11. An important observation is that the mapping «K +> b(k,-), con- 
sidered as a family of nonlinear operators on suitable function spaces, is not 
differentiable at « = 0. Indeed, 


d 1 1 
T Ev |o a z lmallr-2 = z lmlz- 


with singular behavior at low frequencies that conflicts with slow decay prop- 
erties in the presence of internal stray fields. The perturbation at k ~ 0 is 
therefore singular, and only continuous versions of the implicit function the- 
orem are at our disposal. 


Finally, 
Dzôzz = 0?z + T(z) (ðzz, 0,2) 


denotes the second covariant derivative. Surprisingly, the form of Walker so- 
lutions remains almost unchanged: 


zo(a) = (0. tanh È (1+ sin? p)? 2|) 


Recall that the associate propagation speed co inherits the Walker angle y, so 
for each H we identify the Walker solution with the pair (zo, co). Introducing 
a moving frame x |> g + cyt, (2.24) reads like 


G((z,¢), K) = —DzOez + c(a + i)ðsz + f(H, z) +b(K, z) = 0. 


It turns out that our choice of canonical stereographic coordinates provides 
an almost triangulation for the linearized problem. Its spectral properties can 
be summarized as follows: 


Proposition 2.12. For sufficiently small field strength H the linearization at 
the Walker solution (zo,co), has the form 


oG Lı M: 
E ((eove0).0) = lig 28] i HPC) > RO, 


where Lı : H? (R) — L? (R) has a bounded inverse while Lz and its L?-adjoint 
have zero as a simple eigenvalue with eigenfunctions vg = Imzġ and Yo, 
respectively, so that the integral f vo Yo dx > 0 exists. Moreover, ||M,|| can be 
made arbitrarily small by choosing H small. 


Since z6(x) = 44|,=020(a + A) can be seen as the infinitesimal generator of 
translation symmetry, the proposition suggests that degeneracy only stems 
from translation invariance. Thus we introduce the extended functional equa- 
tion 


G((z,¢c),«) = [G((z,¢),n),1m 2(0)| = (0,0). (2.25) 


Its linearization with respect to (z,c) at the Walker solution (zo, co) has the 
form 
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Li Mo —vp 
Lo = Mı Lo avo 
0 do 0 


In view of Proposition 2.12, the invertibility of Lo : H?(IR;C)xR — L?(R;C)x 
R for sufficiently small H would follow from a Schur-type argument once we 
have shown invertibility of the 2 x 2 matrix on the lower right. But this follows 
from a standard Fredholm argument (cf. the proof of Thm. 2.15), taking into 
account Proposition 2.12 and the positivity of vj. Now the continuous version 
of the implicit function theorem implies the solvability of (2.25) for sufficiently 
small x > 0. 


2.4 Domain wall motion in thin films 
A wave-type limit for Landau-Lifshitz-Gilbert 


Gyromagnetic precession is geometrically incompatible with the asymptotic 
constraint of in-plane magnetization that is imposed by stray-filed interaction, 
in other words, the competition between energetic and dynamic forces becomes 
singular in a thin-film limit. Thus domain wall motion in thin films should be 
governed by a suitable effective limit for LLG 


Oom+ymAVE(m)+amA jm = 0 (2.26) 


as the relative thickness 6/d tends to zero. We recall that Gilbert damping a: is 
a small parameter as well, that is to say, precession proceeds much faster than 
relaxation. Prior work on thin-film reductions for LLG, leading to enhanced 
dissipation, cf. [EGO1, KS05a] and Thm. 4.9, consider the regime when 6/d < 
a. In order to preserve the oscillatory features of LLG dynamics we take into 
account small Gilbert damping as well. As it turns out, the effective dynamics 
depends on asymptotic regime as a and the relative thickness 6/d tend to 
zero. Rescaling space by the tail width w = 6/(2Q) and energy by the quality 
factor, we get, for 3-dimensional transitions m = (m, m3) : R > S?, a domain 
wall energy of the form 


Q 1 1 
E.(m) = Eom) += | m? de + limite (2.27) 


where Q = 4K?Q and £? = Q. The in-plane portion of the energy Eo(m) is 
given by 


Q 1 2 1 
Eo(m) = > J Im]? + 5 [mill + ae — m3) dz. (2.28) 


For in-plane magnetizations, it agrees with the standard Néel wall energy that 
we have considered before. We investigate the regime when € — 0 while Q 
is uniformly bounded from above and below; in other words € ~ 6/d can be 
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considered as a relative thickness. Let us consider the associated LLG equation 
in the asymptotic regime when a(e)/e — v. Rescaling time by </7, the system 
(2.26) becomes 

m+emAVE-(m)+amA mm; = 0. (2.29) 


Theorem 2.13. Let m, : R x (0,00) — S? be a family of global solutions 
of (2.29) with uniformly bounded initial energy E-(m,(0)) < c. Suppose that 
a(e)/e — v and the in-plane components me — m converge locally in L?. 
Then 

[07m + v ðm + VEo(m)] Am = 0. (2.30) 


Proof. We let m = (m, ev), i.e. we blow-up the vertical component. Then the 


energy can be written as E:(m) = Eo(m)+G.(v), where G.(v) = 4 f e?|v'|?+ 


|u|? dx. For ve = a(e)/e, the Landau-Lifshitz-Gilbert system (2.29) can be 
written as 


0 =v m 
E 
Or C) + |v 0 -m E n ) =0 
v ee ae elv) + E?Ve Ov 


The energy inequality implies the requisite a priori estimates 
T 
ve f lðrmelĝ2 + Eome (T)) < Be(an(0)), 
0 


T 
ev i livella + Ge(ve(T)) < Ee(m,(0)), 
0 


and passing to the limit yields the following set of equations 


ðm — vmt =0, 
buv + mt. [VE(m) + v ôm] =0. 


From that system the vertical blow-up function v can be eliminated. Indeed, 
the first equation is equivalent to 0,v = 02m-m+. Substitution into the second 
equation yields the result. 


Remark 2.14. Under further regularity assumptions, especially the validity of 
the energy inequality, the asymptotic limit holds true in higher dimensional 
situations as well. 


Traveling waves and kinematic properties of Néel walls 


The latter asymptotic limit suggests the following dynamic model for the evo- 
lution of Néel walls in thin films under the influence of a constant applied field 
h = Hê» that points towards one of the end states determined by anisotropy: 
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(d?m + v ðm + VEo(m)) Am=haAm, (2.31) 
m:R x [0,T) S! with m(+oo,t) = (0, +1). 











Representing the transition vector in polar coordinates m = e’®, the tran- 
sition energy becomes 














1 
£0) = S WP ax + 3l cos Bl for (+00) = +5, 


where || f||3;1;2 = f (1+1€|)|f(€)|? dé denotes the full H'/? norm incorporating 
anisotropy and stray-field interaction. Then the reduced dynamic equation 
(2.31) reads 
820 +v 80 + VE(0) =h-ie®, (2.32) 
6:Rx [0,T) —R with 0(-+00,t) = (0,+7/2). 











The latter damped wave dynamics invites for a kinematic interpretation 
for the wall center as a point mass with constant force and dynamic friction. 
The argument will be rather informal; asymptotically, however, the kinematic 
findings will be justified rigorously in the context of the traveling wave result 
below. Indeed, if H is assumed to be suitably small, the moving phase 0 = 
6(a,t) is presumably close to the stationary phase profile 69 shifted by q(t), the 
center of the wall at time t. Hence we make an ansatz 0(x, t) = 0o (x, t)+01 (x,t) 
where, with a slight abuse of notation, 0o(x, t) = 0o(x + q(t)) and 6;(z,t) is 
assumed to be a small perturbation. Then we approximate 


VE(0) = VE(80) + L001, where Lo = DVE(00), 


and cos(0) = cos(99) — sin(9@9)0@1. Now if 0ı is a solution of the linearized 
problem 070; + vô;0ı + Lobi + H sin(9,) 61 = 0, then 


96 lgl? + 00 G+ 4 = H cos ðo. 


Thus, the associate momentum p(t) = M q(t), where M = 4 f |0|? dx can 
be interpreted as the wall mass, satisfies the equation p + vp = H with 
terminal momentum p* = H/v. We observe that M is consistent with the 
wall mass we encountered in section 2.3 rising from the infinitesimal increase 
of energy. Accordingly, the mobility, i.e. the rate of change of propagation 
speed with respect to H, is given by 3 = 1/(Mv), consistent with our rigorous 
perturbation result: 


Theorem 2.15. For sufficiently small field strength H there is a traveling 
wave for the reduced Landau-Lifshitz-Gilbert dynamics 


ĉo" + cv + VE(0) = H cos 











that connects antipodal states at infinity @(400) = +1/2. Moreover, the prop- 
agation speed has an expansion c = BH + o(H) where the wall mobility 
B = 1/(Mv) is related to the wall mass M = 5 f \06|? dx taken from a 
stationary Néel wall 60. 
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Proposition 2.16. Suppose that Oo is a critical point of E(0) subjected to 
center and boundary conditions 0(0) = 0 and @(t00) = +7/2. Then the 
Hessian 











F 2 
DEl) o) > f Ple de + esinda > 0 
is non-negative for any admissible variation yp. 


Proof. Let (f,g)m1/2 = Re f (1+ lel) F(E) G6) dé be the H'/? inner product. 
Then 


D*E(9)(v, p) = of Iv"? — (cos 0, y? cos 8o) paya + || sin Ollia- 


In order to estimate the middle term, we deduce from the Euler-Lagrange 
equation 


o fo (y? cot Ao)’ dæ = ( cos ĝo, p? cos 99) j71/2- 
Recalling that —d(cot 60) /d0 = 1 + cot? 0, then the claim follows immediately. 


The Proposition states in particular that any critical point of E is in fact 
the phase of a minimizing Néel wall. 


Corollary 2.17. For centered Néel walls 0o, the linearization Lo = DVE (00) 
extends to a self-adjoint operator on L? (R) having zero as a simple eigenvalue 
with eigenspace spanned by 09. 


We need the following refinement of Proposition 2.8 that in particular rules 
out a plateau of Néel wall profiles: 


Lemma 2.18. The phase 9 of a stationary Néel wall is strictly increasing. 


Now the proof of Thm. 2.15 follows closely the one carried out in [BF*97]. 
We let G((0,c), H) = c? 0" +cv 6’ +VE(0)—H cos and consider the extended 
functional equation 


G((0,c), H) = [G((0,c), H), 0(0)] = (0,0). 


Observe that for a stationary Néel wall 0) and 0 = 09 + 60, the mapping 
((0,c), H) ++ G((0,c),H) is smooth. The linearization with respect to the 
first two components (0, c) at the stationary Néel wall (60,0) reads like 


— Lo v 0b 
c= ea, 


As a mapping Lo : H?(R) x R > L?(R) x R it has a bounded inverse, i.e. 
for every (f,b) € L?(R) x R there is (¢,c) € H?(R) x R so that Low + 
v6,c= f and ¢(0) = b. According to the Fredholm alternative and Corollary 
2.17 the first equation is solvable provided cv f |00|? dx = f foh dx that 
fixes c and determines ¢ up to a multiple of 6). But in view of Lemma 2.18, 
the second equation provides uniqueness. Now the implicit function theorem 
implies the existence of a differentiable branch H +> (0(H),c(H)), so that 
c'(0) v f |00|? dx = 2H, and the claim follows. 
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3 Boundary vortices in a 2D model 


In this section, we present results on a specific thin-film limit of the magnetic 
energy for a special regime of rather small films. We will analyze a limit of 
a two-dimensional functional derived by Kohn and Slastikov [KS05b]. They 
considered soft magnetic films without an external field, which corresponds 
to a functional that uses only the exchange and magnetostatic terms. They 
studied the asymptotic behavior of the corresponding version of (1.1) in a thin 
film N; = N x (0,6) with diam Q = 1 for 6 0 and net —> £ € (0,0). 


The energy divided by 2x5" then I’-converges to the limit functional 


FETE sti f mv (3.1) 


defined on maps m € H!(92, S1) (so mg = 0). Here v is a unit normal to 02. 
The Kohn-Slastikov theorem shows that for this special scaling, the nonlocal 
contribution arising as the energy of the induced field reduces to a local term 
charging the boundary. The reason for this simplification of the functional 
lies in a separation of scales between the energy contribution of volume and 
surface charges to the field energy. 

In the following, we present results of Kurzke [Kur06, Kur04, Kur05] on 
the limit of (3.1) as € — 0, for a simply connected domain 92. The results 
share some features with those of Moser [Mos03, Mos04] that are presented 
in Section 4. In particular, sequences of minimizers develop vortices on the 
boundary. 

Due the two-scale process of first letting 6 — 0 to obtain (3.1), a two- 
dimensional problem, and then letting € — 0, our approach can be seen as a 
simplified model for the boundary vortices in Section 4. Since our functional 
here is local, the results are more detailed, especially for the asymptotic dy- 
namics. 

The sequence of functionals Ekg is rather similar to the Ginzburg-Landau 
functional for superconductivity of [BBH94]. With mo = 7 being a continu- 
ous unit tangent field to 0, we are (after rescaling and renaming variables) 
considering the variational problem for m : R — R?: Minimize 


1 2, 1 2 1 
= ea P< tines 
zf ym + 5s ps (m : mo) )dH 


subject to |m| = 1 in Q as € > 0. This problem has an interior constraint and 
a boundary penalty. 
Bethuel, Brezis and Hélein [BBH94] studied the behavior as € — 0 of 


1 2 1 2)\2 
2 aes, i 
zf VP+ zee f 0- mP) 


subject to m = mp on OM, so this problem has a boundary constraint and an 
interior penalty. 
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Common to both problems is that, as long as mg has nonzero topological 
degree, there is no map in H!(Q;R7) that satisfies the constraint and makes 
the penalty term zero. This is due to the fact that a continuous map w : 02 — 
S! can be extended to a continuous map U : 2 — S! if and only if deg(w) = 0. 
Although H+ maps need not be continuous, the argument still carries through 
to show that there is not even an extension of finite H+ energy. Both problems 
are thus forced to develop singularities as € — 0, and the minimum energy 
will become unbounded. 

We call the singularities of both problems vortices, since minimizers con- 
verge as € — 0 to maps that have the form a near the singularities a;. 
In the Ginzburg-Landau case, these vortices are interior and each carries a 
topological degree of 1; in the Kohn-Slastikov case, the singularities lie on the 
boundary, and we only see one half of the vortex. Each “boundary vortex” 
corresponds to a transition from mp to —mo or vice versa, and can be viewed 
as carrying $ topological charge. 

It was shown in [Kur06, Kur04] that a single boundary vortex carries an 
energy of $log + (see Thm. 3.3), and that the interaction of these vortices 
is governed by the next order term in the energy expansion, a renormalized 
energy that can be calculated by the solution of a linear boundary problem 
(see 3.6). In [Kur05] it was shown that this renormalized energy actually 
governs the motion of the vortices in the natural time scaling, when time is 
accelerated by a factor of log + (see Thm. 3.19). 

In the following, we will explain these results in more detail, and in the 
proper frameworks of two orders of J’-convergence and I’-convergence of gra- 
dient flows. 

A major advantage of the simplified energy (3.1) is that the problem can 
be made scalar since m € S! can be written as m = et”. The energy functional 
can then be rewritten as 


1 1 : 
E-(v) = sf |Vo|? + ae sin?(v — g), (3.2) 


where g is a function with ie’? = v. Since Q is simply connected, the degree 
of v as a map from ôN (which is homeomorphic to $+) to St is 1, and so g 
needs to have a jump of height —27, but can otherwise be chosen as smooth 
as ðN. As £ — 0, minimizers ve of Es will now satisfy sin? (ve — g) ~ 0 on most 
of 022, but due to the jump of g, this will not be possible everywhere, and so 
singularities will develop that correspond precisely to the fast transition from 
mT toms —T. 

We can obviously generalize this to g having a jump of height —27D, 
D € Z, corresponding to deg(e’?) = D, which will we do in the following. 
Without loss of generality, we will assume D > 0. The magnetic case of the 
Kohn-Slastikov functional is given by D = 1. 
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3.1 Highest order asymptotics 


The following calculation gives an upper bound for the energy of a single 
boundary vortex. We will use a localized energy €-(v; B) that is defined by 


1 1 
E-(v; B) = J) Jive v|? + ie sin?(v — g). (3.3) 
n 


To simplify matters we will assume that 2N Br(0) = B} (0) is a half-ball and 
that g = 0, corresponding to the constant tangent. We set Tr = 02M Br(0) 
for the flat boundary which we assume to be part of the z-axis in z = x + iy- 
plane. 





Proposition 3.1. There is a sequence ve € H'(Br(0)) with ve|rR > Vs = 
TXæ<o in all LP (IR) for 1 < p< œ and 


R 
Elve; Br) < + log = +0. (3.4) 


Proof. Set ve = arg(z) in B$ (0) \ B4 (0). Choose any H* continuation w with 
0<w<aof arly p+ (0) to BY (0) and set ve(z) = w(ez) inside BY (0). This 
sequence obviously satisfies the claims. 

This shows that for R = O(1), a typical vortex has an energy of approximately 
Z log + + O(1). A combination of 2D such vortices to counter the jump of g 


leads to the following upper bound: 


Proposition 3.2. Minimizers ve of Es satisfy 
1 
Elve) < TD log = +C(2). (3.5) 


A different interpretation of “every vortex carries an energy of 5 log = i” is 
given by the following I’-convergence theorem: 


Theorem 3.3. Assume (ve) is a sequence of functions with E-(ve) < M log +. 
Then there exists a sequence of as E€ 27Z such that the boundary traces we = 
(ve — az)|aq are bounded in an Orlicz space of type e}, and in particular, 
Ilwellre(aa) < C(M). 

The sequence ws is then precompact in the strong topology of L'(0Q), and 
every cluster point w satisfies w — g E€ BV(02;7Z). In addition, we have the 
lower bound inequality 


lim inf 


e>0 2log at J eP 2 i. |D(w — g)|- (3.6) 


Conversely, for every u with u — g E€ BV(02;7Z) there exists a sequence 
ue E€ H!(Q) such that the trace satisfies ue > u in L1(OQ) and with 


DE p fv Ue|? = ih |D(u — 9)I. (3.7) 
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We will not prove 3.3 here. A proof based on a nonlocal representation using 
the H'/? seminorm and rearrangement inequalities is given in [Kur06], and 
an extension to higher dimensions via slicing that utilizes the Orlicz bound 
is shown in [Kur05]. Both proofs are based on ideas of Alberti-Bouchitté- 
Seppecher [ABS94, ABS98] for similar functionals with a coercive instead of 
periodic potential. 


Remark 3.4. Theorem 3.3 shows that, since the BV type limit functional has 
a lower bound of 7D, the energy of minimizers of Es is 7D log ++ o(log +). For 
other converging sequences, we obtain just the number of jump points with 
multiplicities, but the limit functional is independent of the position of these 
jump points. 


3.2 Separation of vortices and renormalized energy 


We will relate the dependence of the energy on the position of the singularities 
to a renormalized energy given as follows: 


Definition 3.5 (Possible limit functions). Let d; € Z with X); di = 2D 
and a; E€ ORQ be distinct points. We define the canonical limit function v, = 
vs(a;,d;) to be a harmonic function with sin?(v, — g) = 0 such that its trace 
on ON jumps by —1d; at the point a;i. 

The renormalized energy is defined to be 


1 1 
idi) =z li P- ?log= |. ; 
W (ai, di) 5 (f y 7rd vet) (3.8) 


The renormalized energy can be expressed via the solution of a linear bound- 
ary value problem for the Laplacian, see [Kur04]. 

With the renormalized energy, we can formulate the following second-order 
I’-convergence type theorem. For minimizers, we will have N = 2D as above. 


Theorem 3.6. If (vz) is a sequence of functions with E-(ve) < M log + and 
Ve — Us(a;,d;) in L?(OQ), with di € {+1} and a; distinct, i=1,...,N, then 





N 1 
lim inf (e) = = (ios = +1—log 2)) > W(a;, di). (3.9) 


e—0 


If additionally (vz) are stationary points of Ez, then ve > vx in W1?(Q) for 
p< 2 and in Hj, away from the ai, and (3.9) holds with equality. 

Furthermore, for any di E€ {+1} and a; distinct, i = 1,..., N, there exists 
a sequence of functions we such that we > vs (ai, di) and 





e—0 


lim (ewa = ™ (x : +1 —log 2)) = W (ai, di). (3.10) 
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We will prove this theorem only partially, for the special case of minimizers, 
where the bounds and the convergence follow from a comparison argument. 
The general case can be shown by means of some extra PDE estimates and a 
regularization technique of Yosida type, replacing the sequence by an improved 
sequence that minimizes a modified functional, see [Kur05]. 


Proposition 3.7 (Euler-Lagrange equations). Stationary points ve of Ee 
satisfy the equations 





Av. =0 in, (3.11) 
Weiss Sagi ) ondQ (3.12) 
a z” ve— g) onol. ; 


Lemma 3.8 (Rellich-PohoZaev identity). For a Lipschitz domain G and 
a harmonic function v € H?(G), there holds 


P(e) = >| z-v|Vol?. (3.13) 
ag OV aG 


Proof. This is easily seen using by testing Av = 0 with z- Vv. 
An easy consequence is 


Lemma 3.9. For a starshaped Lipschitz domain G, there exists constants 
such that every harmonic function v € H?(G) satisfies 


2 
a EA =, 
ac aG aG 


Following ideas of [BBH94] and [Str94], we relate the penalty term + fzo sin?(v— 
g) to a radial derivative of the energy: 


2 2 


Ov 
OT 


Ov 
Ov 


Ov 


- (3.14) 




















Definition 3.10. For zo € ON and v € H?(Q) we set 


A(p) = Avez (P) = of [Vo] dH! + ej sin? (v — g)dH°. 
3Bp(z20)NN E JAB,(z0)NOL 
(3.15) 


For stationary points of the energy, A can be used to bound the penalty term: 


Proposition 3.11. There exists co > 0 and C > 0 such that for all € < £o, 


p<e«/*, any stationary point v of Ez, and any zo € N there holds 
1 
=| sin?(v — g) < Alp) + CVE. (3.16) 
2e Tp(zo) 


Proof. For simplicity, we show this only for g = 0 and a flat boundary. We 
use Zo = 0 and apply (3.13) on the domain wp = 92 N B,(0), which shows 
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1 7 7 dv |? ðv 
= z- v| = a — —z:-Vv. 3.17 
2 Jaw, | aB,na | OV r, OV oan 
Using the Euler-Lagrange equations, we obtain 
2 
Lana ESE 
= =p —| —— | sin2v(z- Vv). 3.18 
2 JaB,na aB,na | OV 2e Jr, (R 
Integrating by parts, we see that 
1 | - 2 p n2 o_ 1 : 
— | sinf(v)= — sin“ (v)dH* — — | sin2v(z- Vu 3.19 
2e Jr, 2€ JoB,nan 2e Jr, ) oe) 
mais sin?(v)dH® + a |Vo|? — ef 2 i 
2€ JoB,nan 2 JaB,na OB,NQ Ov 
(3.20) 
1 
< z4). (3.21) 


We obtain the following criterion for vortex-free parts of the boundary: 


Proposition 3.12. There exist constants y > 0 and C > 0 such that for every 
zo € OD, € < £o, p < €3/4 and every stationary point v of Ee with A(p) < y 
there holds 


1 
sup sin?(v — g) < = (3.22) 
Tp/2 4 
and i 
= sin?(v — g) < C. (3.23) 
2e Tig 


Proof. By Lemma 3.9, we can estimate 


2 
f <C [Vol]? + cf 
T, 3B, N2 T 


We thus can estimate, using Sobolev embedding in one dimension 
Ov 

2 <C | — 
elcossecr,) S r, [Or 


1 1 


P 


2 
<C 


Ow p 


2 


Ov 


OT 


Ov 


Ae (3.24) 


Ov 
Ov 




















2 








sin?(v — ») < £ (27 + Gye). 


Assuming now that sin? (u(z) — g(z)) > 4 for some z € T,/2 and choosing y 
and £o sufficiently small then leads to a contradiction. 
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Disintegrating the energy radially, we see 


Lemma 3.13. Let (v-) be a sequence of stationary points of E satisfying 
the logarithmic energy bound Ez(v~-) < M log . Then for any zo € ON, the 
function A(p) = Aov,,e,29(p) defined above satisfies 





84 
p)< Bee Ez(Ve; 20 Bes6(Z0)) < 84M (3.25) 
(= 


inf 
28/7<p<e5/6 


and 
inf A(p) < 60M. 3.26 
age ak (p) < (3.26) 
Using Vitali’s covering lemma, we can use this and (3.16) to show a local 
upper bound on the penalty term near an almost singularity and in a second 
step a covering of the set of almost singularities. This leads to 


Proposition 3.14. There is a constant N = N(g,2,M) such that for any 
sequence of stationary points vs satisfying the energy bound E.(v-) < M log +, 
the approximate vorter set Se = {z € OQ: sin?(v-(z) — g(z)) > 4} can be 
covered by at most N balls of radius £, such that the €/5 balls around the same 


centers are disjoint. 


For comparison arguments we shall need the following lower bound for the 
energy on half-annuli whenever v — g is in different wells of sin? on both parts 
of the boundary: 


Proposition 3.15. Let 0 < p < R < Ro, Ro sufficiently small, zo € ON, 
w.l.0.g. zo =0. We examine the “half-annulus” Dr p = (Br\ Bp) N 2, which 
can be described by choosing functions 01(r),02(r) as {re : Wi(r) < 0 < 
Vo(r),p <r < R} with |Va(r)—Vı(r)— r| < Cr. Assume also that for j = 1,2 
there holds (v — g)(ret’s"")) € (kya — 6, kya +ô) for some kj € Z and some 
small 6. Then 
Ev; Drp) > F(a — ki)? log — Olka — hy )*(R + Ê). (3.27) 
Proof. For simplicity, we will assume g = 0, 0; = 0 and v2 = 7, corresponding 
to a flat boundary. We will set v;(r) = v(ret®+) for the function on the two 
boundary components. We also assume w.l.o.g. kı = k and ko = 0. Using 
polar coordinates, disregarding the radial derivative and by use of Hölder’s 
inequality, we calculate 
R T 2 
f |Vul? > n - | 
Drip p Jo 


OUT agar 


Ov 


EL (f 








ov 
Ov 


joi fee 
TJs r 
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We rewrite vı — v2 = kr — (vı —kr)— v2. Using the lower bound sin?(t— kir) > 
ot? valid for |t| < 6 with some o = a(6), we can estimate 





R o 2 
E-(v; Drp) = J) ier ((v, — kr) v2))” | s(t | v3) dr. 


2 Tr 


On the last term, we use the inequality v? +v3 > $(v, —km+v2)?. Then we use 
the inequality a(A— B}? + 86B? > my’ on A= kr and B = (v1 — kr — v2). 
The claim then follows by integration. 


We now recall that ve are minimizers of € satisfying the upper bound 
1 
Elve) < TDlog A + Co (3.28) 


for some constant Co, where D is the degree of e’9. We will use an appropriate 
lower bound for the energy away from the vortex set to show convergence by 
a comparison argument. The same arguments also hold for stationary points 
satisfying (3.28). 

By Proposition 3.14, there exist aj € 02,1 < j < Ne < N such that 
the approximate vortex set S- satisfies Se C U< j<Ne B-(a5). Passing to a 
0 


subsequence of € — 0, we can assume that Ne = No is constant and aj > a; 


as € — 0. Note that the a} need not be distinct. We define for 0 < o < 


3 ming 249, dist(a},a},) the sets 25 = 2\U; Bo (af) and = 2\U; Bs (a8). 


With this setup (and this subsequence), the lower bound of Proposition 3.15 
can be combined with the arguments of Struwe [Str94] to show 


Proposition 3.16. There is a constant C = C(g, R, Co) such that Es (ve; 25) < 
TD log + +C. 


We obtain convergence to the canonical harmonic function: 


Proposition 3.17. Let (ve) be a sequence of critical points satisfying the en- 
ergy bound 


1 
E-(uz) < TD log F + Co. 


Then there is a subsequence and N = 2D points a,,...,an € OQ such that 
| [Vue]? < M(2’) < œ (3.29) 
QQ’ 
for all open Q' with QD C Q\{ay,...,an}. Additionally, there hold the bounds 


i Vel? < C(p) (3.30) 
2 


uniformly in € for all 1 < p < 2. In particular, after adding a suitable ze € 
2rZ, a subsequence of (ve) converges weakly in HL, and W'?, p < 2, toa 
harmonic function v«. The limit has the properties that (v, — g) is piecewise 
constant on ƏN \ {a1,...,an}, with values in TZ, and jumps by —r at the 
points aj. 
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Proof. We use the setup described above, in particular, we use the points a$ 
as defined there. Note that for € < €9(c), there holds 22 C RE J2 and so by 
Proposition 3.16, 


2 
| IWvel? < 2£(ve 26 a) < 2xDlog = +C, (3.31) 
29 


which proves (3.29). To obtain the L? bounds (3.30), fixao >Oand1<p< 2. 
Then by Holder’s inequality and Proposition 3.16, 


f vees f VeeS f [Vve]? 
Q NE =1 25 


-e\o 
68 p/2 
Oe Ie aig 2? ( f vet) 
l=1 itz 


> 1 
LC hey Be Pit (2D log. + c) 
A 2*0 


p/2 





ZO 


From this L? gradient bound, we obtain the weak compactness up to transla- 
tion. The weak limit v, is harmonic since te Vvs Vo = limeso So Vue Vg =0 
for all p € CX (Q). That the boundary values satisfy v. —g € TZ with possible 
jumps at the a; follows from fao sin?(ve — g) — 0 and (ve — g) being close to 
TZ outside the approximate vortex set Se. 

That the vortices are indeed single and N = 2D can then be shown by 
some refined arguments which prove that higher-order vortices must have far 
higher energy. 


The energy of these limit functions v, away from aq; is the renormalized energy 
of (3.8), and the energy of ve on 2, converges to that of v.. To prove equality in 
(3.9), we thus need to calculate the energy of ve close to a;. This is done by an 
e-scale blowup, which leads to a half-space problem. The solutions of the half- 
space problem are explicitly known and essentially unique (Toland [Tol97], see 
also Cabré and Sola-Morales [CS04] for a more general uniqueness theorem). 
Comparing ve with the rescaled half-space solution and some estimates (see 
[Kur04]) then show the rest of Thm. 3.6. One can even show 


Proposition 3.18. For a sequence ve of stationary points of Es, the configu- 
ration of vortex points (a;) is stationary for the renormalized energy W (aj, di). 
For minimizers ve, it is minimizing. 


3.3 Motion of vortices 


Theorem 3.6 and the previous proposition show that the renormalized energy 
W governs the interaction of the vortices on an energetic level. It can be shown 
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that also the motion of the vortices by the gradient flow (which corresponds 
to the LLG flow since we restrict possible magnetizations to a plane) is given 
by the renormalized energy: 


Theorem 3.19. Let 0 < T < œ and let (ve) be a sequence of solutions of 





AcOpue = Ave in Qx (0,T) (3.32) 
Ove 1. 
at oe sin2(ve—g) on 02 x (0,00). (3.33) 





For the initial conditions we assume that v-(0) > v.(a;, di) for di E€ {+1} and 
distinct a;. Furthermore, ve is supposed to be initially well-prepared, meaning 
that 


TN 


5 (1 — log 2) < W (a;i, di) + 0(1) (3.34) 


aN 1 
Ee(ve(0)) — a les a 
asé—> 0. 
Depending on the asymptotic behavior of Ae, we then have: 


(i) If Xe = BT then there exists a time T* > 0 such that for all t € [0, T*), 


there holds ue(t) > v,(ai(t),di(0))). Furthermore, the a;(t) satisfy the 
motion law d T 

Qi 

= —-— W (a;(t), d; (0 3.35 

at = = W(aslt)-di(0)) (3.35) 
in the tangent space at a; to ON. If T* < T is the maximal time with 
these properties, then as t > T*, there exist i # j such that a;(t) and 
a;(t) converge to the same point. 
The energy of ve(t) satisfies the expansion 








Ti log = | Tia log2) + W(a(t),d) +0(1) (3.36) 





Ee(ve(t)) = 


as € — 0. 

(ii) If Ac log + + 0 as € — 0, then for almost every t € [0,T) we have ve(t) > 
vx (a;(0), di(0)), so there is no motion. 

(iii) If re log + — œ as £ — 0, then for almost every t € [0,00) we have 
Velt) > vx(b;,d;) with VW (bi, di) = 0, so the system instantaneously 
jumps into a critical point. 


Again, there are strong similarities between this result for the motion of 
boundary vortices and those in the theory of gradient flow motion of inte- 
rior Ginzburg-Landau type vortices as studied by Jerrard and Soner [JS98] 
and Lin [Lin96a, Lin96b]. 

The proof in [Kur05] is based on the technique of -convergence of gradient 


flows of Sandier and Serfaty [SS04a], applied to the functionals 
N 1 
F(u) = E(u) — “(log = + 1—log2) (3.37) 
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and the limit functional 
F (ai) = W (ai, di). (3.38) 


We need some additional definitions: 


Definition 3.20. We say that functionals F© I’-converge to F along the tra- 
jectory u(t) with respect to the convergence «By if there exist u(t) and a 


subsequence such that for all t, ue (t) u(t) and 
lim inf F*(ue(t)) > F(u(t)). (3.39) 
E> 


The energy excess D.(t) and the limiting energy excess D(t) for a sequence 
ue(t) are defined via 


Dz(t) = E*(ue(t)) — E(u(t)), D(t) = limsup D.(t). (3.40) 
e—0 
If u(t) are solutions to the gradient flow for E€ that satisfy D(0) = 0, they 
are said to be initially well-prepared. 


The proof of Thm. 3.19 relies on the following version of Sandier and Serfaty’s 
theorem on the I’-convergence of gradient flows: 


Theorem 3.21 (Sandier-Serfaty [SS04a]). Assume F€ € C!(M) and F € 
C1(N). Let us be a sequence of solutions of the gradient flow for F€ on [0, T) 
with respect to the metric structure Xs that satisfy 


F#(ue(0)) — F*(ue(t)) = f Oete(s)|I%, ds. (3.41) 


Assume ue (0) uo, that F€ I'-converges to F along the trajectory uz(t), and 
that (ue) is initially well-prepared. Furthermore, assume that (LB) and (CON) 
hold: 


(LB) For a subsequence such that u-(t) u(t), we have u € H'((0,T);N) and 
there exists f € L'(0,T) such that for every s € [0,T) there holds 


imit f louel at> f (lalh yv- FDO) dt. (3.42) 


(CON) If u(t) u(t), there exists a locally bounded function g on [0,T) such 
that for any to € [0,T) and any v defined in a neighborhood of to that 
satisfies v(to) = u(to) and Q%vu(to) = —Vriq,vE(u(to)), there exists a 
sequence velt) such that ve(to) = ue(to) and the following inequalities 
hold: 


lim sup ||d:v-(to)[Ix, < lauto) 


v(to 


v + g(to)D(to) (3.43) 


lim inf (-4 
0 


rae dt 





Fo) =A, 





FOO) Ilo) Dlo). (3.44) 


t= 


284 M. Kurzke, C. Melcher, R. Moser 


Then uz(t) u(t) which is the solution of the gradient flow for E with respect 
to the structure of TN. 


This is applicable to our case since (3.32) with the nonlinear boundary con- 
dition (3.33) is the gradient flow of F® with respect to the norm Vz ||-||;2, 
which we will use as the spaces Xe in the terminology of the theorem above. 
The functionals F* are defined on M = H'(Q). As the sense of convergence, 
we use veČ (a;i) if ve —> v (ai, di) in L? (3N). The necessary l-convergence for 
Š follows from (3.9). 

The limit functional is defined on N = {(ai)i=1,...,N : a; Ż aj fori # 
j}, which is an open subset of the (flat) Riemannian manifold (02)%. The 
approach of [SS04a] for Euclidean limit spaces carries over to this situation 
without changes. As the limiting norm on the tangent spaces TaM which are 
identified with RY we use the constant Riemannian metric 4/3 ||-||pw- 

Theorem 3.21 allows us to break up the proof of Theorem 3.19 into two 
separate parts, a lower bound and a construction. The proof of the lower 
bound relies on an anisotropic version of (3.6) in higher dimensions. This 
leads to a product estimate like the one of Sandier-Serfaty [SS04b], which can 
then be used to separate space- and time-derivatives to show (3.42). 

The construction used to show the upper bound inequalities is done by 
taking a well-prepared sequence and “pushing” the vortices along the bound- 
ary with the flow of a vector field that is conformal close to the vortex. The 
conformality ensures that the highest order of the energy does not change 
by the flow. With some more detailed local estimates related to (3.9), the 
estimates (3.43) and (3.44) then follow, as is detailed in [Kur05]. 


4 Boundary vortices in a refined model 


In this section we discuss a thin-film regime that is related to the theory of 
the the previous section; indeed the theory of Section 3 can be regarded as a 
simplified version of what is to follow (but, as mentioned earlier, it can also 
be seen as an asymptotic analysis of a model arising in the thin-film theory of 
Kohn and Slastikov [KS05b]). We now examine the development of boundary 
vortices in thin films for a model that is closer to the actual micromagnetic 
model than the one discussed previously, although we still use some simpli- 
fications. In particular we consider now domains that are three-dimensional 
(but thin in one dimension) and magnetization vector fields with values in 
S? (not St). We continue to neglect the anisotropy term in the micromag- 
netic energy functional and the external magnetic field, but we consider the 
exchange energy and the magnetostatic energy in the form that they have in 
the functional E. 

Naturally, the problem becomes more difficult when we drop some of the 
simplifications. It is not surprising, therefore, that we need more assumptions 
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to obtain less information about the asymptotic behavior of the magnetiza- 

tion. But the results we find for this model are consistent with those for the 

more simplified model, which shows that the latter does indeed describe the 

significant features of the thin-film limit in the asymptotic regime we study. 
We consider the family of domains 


Ns = N x (0,6) 


for 6 > 0, where 2 C R? is open and bounded. We also assume that 2 is 
simply connected and that its boundary is smooth. The outer normal vector 
on 022 is denoted by v. The energy functional (without the anisotropy term 
and the external field) is then 


d? 1 
E(m) = oe A |Vm|? dx + 5 f |Vul? dx, 
ô 


where, as usual, the function u € H! (R?) is determined by the condition 
Au = div(ya;m) in R. 


We assume for the moment that the shape of N2 is fixed, whereas its size 
can still be varied by scaling. The problem then involves three length scales: 
the exchange length d, the thickness 6, and the length scale of the cross- 
section, measured, e.g., by L = diam 2. The asymptotic regime we consider 
is characterized by the condition that d? is of the same magnitude as Lô. For 
simplicity, we assume d? = Lô. Rescaling allows us to normalize L = 1, 
which gives rise to the relation d? = 6 between the exchange length and the 
thickness (and which means that 2 is fixed henceforth). 

Since we study the asymptotic behavior of variational problems associated 
to the micromagnetic energy as ô N 0, we now denote a generic magnetization 
vector field in 2s by mê = (mê, m) € H!(5,S*). The corresponding poten- 
tial for the induced magnetic field is then the unique solution uf € H!(R3) 
of 

Au? = div(xo,m?) in R3. (4.1) 


We also consider the maps 
1 rê 
m’ = =| m(a,s)ds, «EQ, 
0 


so that we can pass to a limit in certain spaces of functions on 2 (usually 
Sobolev spaces). The limit will then be a map m = (m,m) : 2 — S?. If 
we consider the functions 6~'u® and apply equation (4.1) to a test function 
o € C (R?), we can formally pass to the limit. We obtain (formally) a limit 
function u : R? — R with 


Vu-Vd¢dx = | m(x) - Vd(x,0) dx (4.2) 
R3 Q 
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for all ¢ € CF (R?). For the problems we consider here, we have typically 
ms = 0 in 2 and m-v = 0 on OM. If we have furthermore m € W!4/3(Q,S!), 
then equation (4.2) does in fact determine a function u € H1+(R?) uniquely. 
We will see that this function describes in part the limit of the magnetostatic 
energy. Also important is its trace on 2 x {0}. We denote this trace by uo. 

It is convenient to divide the micromagnetic energy by 67. That is, we 
consider the family of functionals 


82 82 
jaa |\Vm?|? at ore f |Vu’| dx. 
Critical points of Es satisfy the Euler-Lagrange equation 
6(Am? + |Vm°|?m°) — (1 —m? @ m°)Vu? = 0 in N3. (4.3) 


It is natural to impose homogeneous Neumann boundary conditions, i.e., 





3mê i 
PTA =0 ingx {0,8}, (4.4) 
3m 

ae = 0 on 0 x (0,6). (4.5) 


Stable critical points also satisfy 


m+ sw 
(eran) 2° 


for all y € C®(Ns, R?). Standard calculations transform this inequality into 


d 
ds? 








1 
o< f (Vnt P- h (Vm? +m? Vu’) d+ f |Vu>|? dx, (4.6) 
25 R3 


where rh? = (1 — m? @ mê). 
For the Landau-Lifshitz equation, there exist two interesting time scales, 
similarly as in the previous section. The first one gives rise to the equation 


Ə ô 
o = —4m? A (Am? — 6-!Vu®) — âm? A (m? A (Am? — 5-!Vu)) (4.7) 
in Rs x (0,T), which is equivalent to 


Om? 


ot 


Here we use the abbreviations 





1 
Rms = Am? + |Vm?|? m? — ab mê @ m°)Vu. (4.8) 








Rms X =aX+4m°AX and a@=- 
ar + 
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We normally use the form (4.8) of the equation. We impose homogeneous 
Neumann boundary data again, that is, 








ô 
m cg wox T, (4.9) 
0x3 
Om?® 
= = 9 on AM x (0,8) x [0, T). (4.10) 
V 


This is the time scale where we expect the development of stationary boundary 
vortices in the limit ô N 0, in analogy to the results of the previous section. 
To study the dynamical behavior of the vortices, on the other hand, we need 
to rescale the time axis by the factor log log 4 (accelerating the time by this 
factor). The equation then becomes 


3m? 
Saa = Am! +|Ym?|?mi- =(1—-m®@m)Vue in Q5x(0,T). (4.11) 
log log + ô 
The boundary conditions remain of the form (4.9), (4.10). 

We want to reproduce the asymptotic theory of the previous section for 
the model given by the energy Es and the equations (4.3), (4.8), and (4.11). 
There are several additional difficulties here, however, that we have to over- 
come. First, the target space for our maps is now S?, not S1, which means 
that mê can no longer be represented by a single phase function. The curva- 
ture of S? also has the consequence that we have to consider equations with 
nonlinear terms involving first derivatives of mê. Together with the fact that 
our domains 25 are now three-dimensional, this means that we must expect 
solutions of the equations with singularities. To simplify the presentation of 
the results, we always assume here that we have smooth solutions; but without 
this assumption, regularity is an issue that requires extra care. 

The most important new aspect of this model, however, is the nonlocal 
operator appearing in the magnetostatic energy. This is at first a major im- 
pediment to using the methods from the theory of Ginzburg-Landau vortices, 
for these methods require pointwise comparisons between integrands of the 
lower order energy terms. To overcome this difficulty, we compare Es with 
another functional that has only local terms, namely 


o 1 
S 





ô\2 l 1 
F5(m°) (vm? + a dx + 23 (mi. v)? dH. 


ô 28 Janx(0,8) 


Here H* denotes the k-dimensional Hausdorff measure. The functional F5 can 
be thought of as the three-dimensional equivalent of 


5)2 
G5(m°) = >| (vmt + = dx + nT (m: v)? dH’, 
Q 


where for the latter functional, we consider mê € H!(,S?). 
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The connection to the theory discussed earlier is obvious. The connection 
to the theory of Ginzburg-Landau vortices becomes even more apparent when 
one observes that (m)? = 1—|m°|? (since mê has values in the unit sphere), 
recovering thus an integrand in the first integral of Gs that is similar to 
the one used in most works on Ginzburg-Landau vortices. Less obvious is the 
connection between Es and Fs. Before we give any rigorous arguments, we look 
at this question heuristically. The magnetostatic energy seeks to minimize the 
divergence of yg,;m°, which consists of two parts: the divergence of mô in the 
interior of 25; on the one hand, and the distribution given by the perpendicular 
part of mê on 025 on the other hand. The latter further splits into two parts 
according to the natural decomposition of the boundary into 2 x {0,6} and 
ðN x (0,6). The part coming from divm? now gives a contribution to the 
magnetostatic energy which is of the same order as the exchange energy. Both 
of the other parts correspond to one of the terms in Fẹ. 

The next few lemmas give a more precise description of the relation be- 
tween these functionals. 


Lemma 4.1. For 6 € (0,e7°] and mê € H1(5,R*), the inequality 
|| Vue IZ ees) s CV5||Vin Ilaso) + Ol|m3(- Olio 


+0 f xoana ) volde) 


holds for a constant C that depends only on Q. Here uf € H!(RÌ) is the 
function determined by (4.1). 


Proof. We have 


| |Vu?|? dx = Vu>-m? dx 
R3 2 


6 
ô 
= f -f wimg de+ | | udm? - v dz dH! 
2x {5} 2x {0} 02 Jo 
5 
-f f 1è divm’ de des, 
0 Jo 


Now we use the continuous trace operators H!(R) — L4(Q) for every slice 
x {x3} and the continuous trace operator H!(Q) > L?(0Q, H'/?(—1,1)) to 


ô 
estimate the traces of u in these spaces. Furthermore, an integration of oma 
along vertical lines gives 


llm3(-,0) — mS(-, llZas) < VÕIV mll aso) 


and the desired estimate then follows from the Hölder inequality. 


Domain Walls and Vortices in Thin Ferromagnetic Films 289 


Lemma 4.2. For 6 € (0,e7°], letm? € H!(2,S') and m? (x, x3) = (mè (x), 0). 
Then 


1 
|| Vu? ll z2¢Rs) < Ce? CAPE + log 5 ||m? - vIRo0)) 


for a constant C that depends only on 22. 


Proof. A direct computation shows that the characteristic function \(0,5) of 
the interval (0,0) satisfies 


1 
IIx (0,5) ll 3/2 Ry S cô? (: + log 5) 


for a certain constant c which is independent of 6. The claim now follows 
directly from Lemma 4.1. 


Proposition 4.3. There exists a constant C, dependent only on N, such that 


: 1 
sage os Es < r log log 5 +C 


for 6 € (0,e7°]. 


Proof. We construct a map m° € H!(Q,S') with two standard vortices cen- 
tered at two different points on the boundary, similarly as in the proof of 
Proposition 3.1 (with € = 1/ log 4). For m° (z, £3) = (mê (x), 0), the estimates 
of Proposition 3.1, together with Lemma 4.2, give a bound for Es(m°) of the 
desired form. 


In fact the quantity 7 log log + gives also a lower bound for the infimum 
of Es in H'(5,S?) up to a constant. That is, it determines the asymptotic 
behavior of this infimum. The proof of the lower estimate is technically more 
involved; we therefore give only a sketch of the proof here. 


Lemma 4.4. For 6 € (0,e7°] and mè € H1(5,S*), the inequality 


| max <cfa | Vmi axt f [Vut ax +0 (4.12) 
Ns 25 R3 


holds for a constant C that depends only on 22. 


Sketch of the proof. We test (4.1) with a function ¢ € C°+(IR?) which is de- 
fined on 22 x R by 


0, if x3 < 0 or x > 26, 
olz, z3) = ¢ f° ma(z, s) ds if 0 < 23 < ô, 
(2 — 23/6) [f ma(a,s)ds if ô< x3 < 26, 
and extended suitably to R3. We recover the left-hand side of (4.12) as one of 


the terms in the resulting equation (after an integration by parts). All other 
terms can then be estimated with standard methods. 
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For s > 0 we now define the sets 


V, = {x EQ: dist(x, ðN) <s}, VI = V, x (0,6), 
r,={#xe2: dist(z,ðN) =s}, TË = T, x (0,6). 


We fix so > 0 such that T; is a smooth curve for every s € (0, 2sọ]. Moreover, 


we define 1 


«(6) = —. 
(9) log 4 

Lemma 4.5. There exists a constant C, depending only on 2, such that for 

every ô € (0,e7°] and every s € [0, so] with s < K(d), the inequality 


ô\2 
ig (m? -v)? dH? < of (vm + = dx 
ô TŠ Vis) ô 


+S] |Vu?|? dx + Cô (4.13) 
R3 


is satisfied for any m? € H!(5,S?). 


Sketch of the proof. The idea is to test (4.1) with a suitably constructed func- 
tion ¢ satisfying @ = mê - v on TÊ and supported on a K(d)-neighborhood of 
TÈ. An integration by parts on one side of the resulting equation then yields, 
among other terms, the left-hand side of (4.13) for s = 0. A careful estimate 
of the other terms gives the required inequality for I, ie To obtain the corre- 
sponding inequality for other values of s, integrate the derivative of (mê - v)? 
along rays in the direction of —v. 


Proposition 4.6. For any K € R there exists a constant C, depending only 
on 2 and K, such that the following holds. Suppose 6 € (0,e7*] and s € [0, so]. 
If m? € H1(5,S?) satisfies 
1 
Es(m°) < rlog log 5 +K, (4.14) 


then 





1 ô J2 ô\2 1 
Ble r3 5 bys, 


1 5/2 log § 6 A2 J2 

+5 |\Vu°|“ dx + — | (m°-v)* dH* < C(4.15) 
6? JRs ô Jrs 

Sketch of the proof. With the arguments from Section 3, combined with sim- 
ilar arguments from the theory of Ginzburg-Landau vortices, applied to slices 
of the form 2\V, x {t}, we obtain the estimate 
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2 2 
1 ðm? ðm? 4: (m)? oe 
26 Ns \VŽ xı Ox ô 

log + 





1 
f (m° - v)? dH? > mloglog = — C, (4.16) 
ô Jrs ô 
for a certain constant C1 that depends only on 22. Combining this with Lemma 
4.4, Lemma 4.5, and (4.14), we obtain the desired inequality. 


Proposition 4.7. There exists a constant C, dependent only on 2, such that 


1 
inf Es > r log log = — 
pa 5 > Tlog 08 5 C 


for ô € (0, e7°]. 


Proof. Choose a minimizer mê of Es in H!(Q5,S?), then (4.14) holds for a 
certain constant K by Proposition 4.3. Thus mê satisfies (4.15) and (4.16), 
and the claim follows. 


Proposition 4.3 and Proposition 4.6 together describe the asymptotic be- 
havior of the minimal energy as ô N 0 up to an additive constant, and it is 
the behavior we expect also for the functionals Fs or Gs. Moreover, if mô is 
independent of x3, we can estimate each term in Es by a combination of terms 
in Gs, and vice versa, according to Lemmas 4.1—4.5. This already relates the 
asymptotic regime studied here to the model used in Section 3. We discover 
more similarities, however, when we study the asymptotic behavior of critical 
points of Es or solutions of the Landau-Lifshitz equations (4.8) and (4.11) 

If we have a family of solutions mê of one of the variational problems 
associated to Es, it is natural to apply variants of the usual arguments from 
the theory of Ginzburg-Landau vortices to the slices 2 x {s} with 0 < s < ô. 
More precisely, we use arguments like those discussed in the previous section 
for the behavior near 02, and arguments from the theory of Bethuel, Brezis, 
and Hélein [BBH93, BBH94] for the behavior in the interior of 2. This is 
the key element in the proofs of each of the results that follow. We omit 
a detailed presentation of these proofs (since similar arguments have been 
discussed earlier), but we give a brief discussion of some additional arguments 
that are needed in each case. For the complete proofs, the reader is referred 
to [Mos03, Mos04, Mos05]. 


Theorem 4.8. For 6 € (0,e~°], suppose m? € C®(N,S?) are stable critical 
points of Es, i.e., solutions of (4.3) satisfying the boundary conditions (4.4) 
and (4.5), such that (4.6) holds for every y € C®(Q5,R°). Suppose further 
that there exists a number K such that 


1 
E5(m°) < 7 log log 5 +K 
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for every 6. Then there exist a sequence dg N 0, two distinct points x!,x? € 
02, and a map m = (m,0) E W'1(2,S! x {0}) with m-v =0 on OQ, such 
that the following holds. 


(i) For any p < 2, the sequence {Ms5,} converges weakly in W1?(Q,IR°) to m. 
The convergence also holds weakly in H'(Q',R?) for every Q! C Q with 
OC ON a? 2}. 

(ti) The limit map m satisfies 


Am + |Vm|?m — Vuo + (m-Vuo) =0 in Q, (4.17) 


where uo is the trace on Q x {0} of the function determined by (4.2). 
(iii) If R? is identified with the complex plane C by z = xı +ix2 (and similarly 
zi=ai+iad and 2? = xz? +ix2 for xt = (xt, x}) and x? = (2?,22)), then 


m has the representation 


ul 2 
Zz may ss i0(z) 


m(z) = 


|z — 21| |z — 2 | 
for a function 0 € C°(Q) which solves 


Ad = poe oF Ou 


— inf. 4.18 
Ox2 me xı j ( ) 


Thus at least at the lowest possible energy level, we observe the develop- 
ment of two boundary vortices in the limit. Note also that the limit equation 
(4.17) is formally the Euler-Lagrange equation for the (formal) functional 


1 1 
z VP de +> f |Vul? dx, 
2 Jo 2 Jre 


where u denotes the function in H+ (R°) defined by (4.2). It turns out, however, 
that this quantity is identically infinite. It can be replaced by a functional 
involving the Dirichlet energy of 0, and then (4.17) truly becomes an Euler- 
Lagrange equation, but we omit the details here. 

The stability condition (4.6) is needed in the proof of this theorem in order 
to estimate the Dirichlet energy of m® in small cylinders of the form 


(B,y5(#) N 2) x (0,6), 
where u > 0 is a fixed constant. This allows to use estimates from the regu- 


larity theory of harmonic maps and to conclude that 


|Vm°| < Ea 


võ 


for a certain constant C which is independent of 6. Apart from the fact that 
such a gradient estimate is normally used in the theory of Ginzburg-Landau 
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vortices, it is in this context also important for another reason: It means that 
mô varies very little in the third direction if 6 is small, for the thickness of 
Ns is small compared with |Vm°|. This permits to work on a suitable slice 
92x {s} and pretend that the domain is two-dimensional for much of the proof. 
The previously mentioned arguments then give (i) and the representation of 
m in (iii). 

To derive the limit equation (4.17), we first take the vector product with 
m? on both sides of (4.3), which gives 


6 div(m? A Vm?) = m A Vu? =0 in Q5. 


This form of the equation has the advantage that it does no longer explicitly 
contain the term |Vm?|?m° (which would be difficult to handle with the weak 
convergence that we have). In particular it is then possible to pass to the limit 
and to show that m satisfies 

Quo Ouo 


div(m,Vmz — m2Vmj,) = mı — 


Agee aN 
gaa Og S 


This equation is exactly (4.18) if m is represented by 6 as in (iii). Finally, the 
equation is also equivalent to (4.17). 


Theorem 4.9. For T € (0,00] and 6 € (0,e7°], suppose m? € C°(5 x 
(0, T),S?) satisfy the Landau-Lifshitz equations (4.11) with boundary condi- 
tions (4.9), (4.10). Also suppose that the initial data 


wh? (x) = m°(x, 0) 
satisfy 
1 
Es(m°) < log log 5 +K 
and 


|Vm’] < 


S/F 


in Ns for a constant K that is independent of ô. Then there exist a sequence 
ôk N 0, a map m = (m,0) € L©([0,T),W*1(2,S' x {0})) with m-v =0 
on ðN x [0,T), and two distinct points x',x? € OQ, such that the following 
holds. 


(i) The sequence {m°*} converges weakly* in L®([0,T),W'?(Q,R°)) to m 
for any p < 2, and also weakly* in L®([0, T), H'(2',R°)) for any Q' C Q 
with R C 2\{a1, a7}. 

(ti) The limit map solves 

om 2 f 

a = Am + |Vm| m — Vuo + (m: Vuo)m in 2 x(0,T), (4.19) 
where uo(-,t) is the trace on 2 x {0} of the function determined by (4.2) 
for almost every fired t € [0, T). 
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(iii) It is of the form 


1 2 
A Z =Z ib (z,t) 


m(z,t) = 





|z — z!| |z — 2?| 


where 0 € C°( x [0,T)) is a solution of 


06 Ouo Ouo 


a— = Ad-—m + moa in N x (0,T). 
Ly 


ot “Axe 
(Here we use an identification of R? with C as in the previous theorem.) 


This is the time scale where we have stationary boundary vortices. The 
limit equation (4.19) is formally the L?-gradient flow for the formal functional 
mentioned earlier (up to a constant). It is also the true gradient flow for a 
related functional. 

It is interesting here to compare the limit equation (4.19) with the original 
Landau-Lifshitz equation, especially if the latter is in the form (4.7). We had 
originally a gyromagnetic term with coefficient 7 and a damping term with 
coefficient â. The gyromagnetic term has vanished in the thin-film limit (as 
it is to be expected when m remains in the plane R? x {0}). We still have a 
damping term, but the damping coefficient is now 


; f 
Lage, 
Q 


Qe] 


Thinking of ¥ as a fixed constant and of â as small in comparison, we are in the 
seemingly paradox situation that decreasing the damping coefficient @ accel- 
erates the dynamics in the thin-film limit. This phenomenon has already been 
discovered by formal computations by W. E and C. Garcia-Cervera [EGO1]. 

With the gradient estimate that we impose on the initial data, we can use 
the same methods as in the proof of Thm. 4.8 to obtain the same development 
of boundary vortices for mm? that we have found for stable critical points. To 
prove this for times t > 0, we need slightly different arguments. Here we 
calculate how the energy develops locally with time, that is, for a function 
n € C§°(R°) with n = 0 in a neighborhood of {x', x7} x [0,6], we calculate 


1 
u (/ n|Vm?|? dx + Fi n|Vu?]? dx) ; (4.20) 
dt \ Jos ô Jgs 


We find that away from the vortex center points x!, x?, the energy increases 
at most by a constant that is independent of ô in bounded time intervals. We 
can then again use arguments from the theory of Ginzburg-Landau vortices 
for fixed times t > 0. 


Theorem 4.10. Under the conditions of Thm. 4.9, but with (4.8) replaced by 
(4.11), there exist a sequence 5; N 0, two curves xt, x? € C°1/?((0,T), 02), 
and a map 
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m = (m,0) € (N L™((0,T),W'?(2,S" x {0})) 


with m -v =0 on ðQ x [0, T), such that the following holds. 


(i) For any p < 2 and any q < œ, the sequence {m} converges weakly in 
L4((0, T), W4?(2,1R?)) to m. 

(ii) For almost every t € [0,T) and every Q! C Q with D' C M {xt (t), x? (t)}, 
the map m(-,t) belongs to H+(Q',S'). 

(iii) The equation 


Am + |Vm|?m — Vuo + (m-Vuo)m =0 in Q x (0,T) 


holds, where uo(-,t) is the trace on 2 x {0} of the function determined 
by (4.2) for almost every fixed t € [0,T). 
(iv) The map m is of the form 


m(z,t) = z-z (t) —— z(t) ei9(zt) 
( , t) lz = z(t] Jz = 20) , 


where 0 € C°(Q x [0,T]) is a solution of 


UO ays te 


^0 = 
nag xı ve Ox 


in Q x (0,T). 

In contrast to the situation of Thm. 4.9, we now have moving boundary 
vortices. This means in particular that when we calculate the evolution of the 
localized energy in (4.20), the vortex centers may enter the support of 7 after 
some time. For this reason, the estimates we obtain are not quite as good as 
before, and the type of convergence we find is weaker. On the other hand, an 
analysis of the energy increase over a fixed time interval permits to estimate 
the distance that the vortex centers have moved in this time (since most of 
the micromagnetic energy is concentrated in the vortex centers). This way we 
obtain the Hélder continuity of zt and x?. 

Finally, comparing Thm. 4.10 with the results of Section 3, especially 
Thm. 3.19, we see that one statement is missing here: We do not have any 
information about the law that governs the motion of the vortices. There is 
no obvious reason why the model discussed here should have a significantly 
different behavior in this respect, but the technical difficulties mentioned ear- 
lier make it hard to carry over the arguments from the simpler model. This 
problem thus remains open. 
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Summary. In order to treat multiple energy- and length-scales in electronic struc- 
ture calculations for extended systems, we have studied a wavelet based multireso- 
lution analysis of electron correlations. Wavelets provide hierarchical basis sets that 
can be locally adapted to the length- and energy-scales of physical phenomena. The 
inherently high dimensional many-body problem can be kept tractable by using the 
sparse grid method for the construction of multivariate wavelets. These so called 
“hyperbolic” wavelets provide sparse representations for correlated wavefunctions 
and can be combined with diagrammatic techniques from quantum many-particle 
theory into a diagrammatic multiresolution analysis. Using sparsity features origi- 
nating from the hierarchical structure and vanishing moments property of wavelet 
bases, this leads to many-particle methods with almost linear computational com- 
plexity for the treatment of electron correlations. 

We are aiming towards applications in semiconductor physics where quasi two- 
dimensional many-particle systems provide challenging computational problems. 
Such kind of systems are metallic slabs and interacting excitons confined in quantum 
wells of semiconductor heterostructures. As a first step we developed a multireso- 
lution Hartree-Fock method suitable for quasi two-dimensional extended systems. 
Special emphasis has been laid on low rank tensor product decompositions of or- 
bitals, which take into account the strongly anisotropic character of these systems 
in one direction. 


1 Introduction 


Quantum many-particle theory provides a general framework for the descrip- 
tion of microscopic processes appearing in solid state physics and chemistry. 
Some of the fundamental difficulties encountered for accurate calculations of 
these processes are inherently related with their multiscale character. The 
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huge diversity of methods and techniques in many-particle theory partly 
emerged from efforts to create tailormade models which describe the essential 
physics at certain energy or length-scales. However, synergetic effects, caused 
by couplings between different scales, often prevent their separate treatment. 
Due to a lack of a priori knowledge of the strength of these couplings, it 
might become necessary to use many-particle models which provide an accu- 
rate description of the system over a whole range of energy- or length-scales. 
In virtue of the multiscale character of such models, it is tempting to study 
possible applications of some recent mathematical developments in the field 
of multiscale analysis. In particular wavelet based multiresolution analysis 
[Dau92, Mal98, Mey92] proved to be a useful tool, both from the analytical 
and computational point of view [Dah97, DeV98]. Wavelets represent sta- 
ble multiscale basis sets that can be locally adapted according to the length- 
and energy-scales of physical processes. Depending on the specific application, 
wavelets can be equiped with a variety of useful properties including compact 
support, (bi)orthogonality and vanishing moments [Dau92]. Sparse approx- 
imations exist for functions containing local singularities [Mal98] and for a 
large class of differential as well as (singular) integral operators [Dah97]. Fur- 
thermore, because of the hierarchical structure of wavelet bases, it becomes 
feasible to apply sparse grid methods [BG04] for high-dimensional problems. 

We have studied possible applications of multiresolution analysis for many- 
electron systems which can be considered as a paradigm for fermionic many- 
particle systems in condensed matter physics. The antisymmetry of fermionic 
wavefunctions introduces, via Pauli’s exclusion principle, a multiscale struc- 
ture into these systems. Their multiscale character expresses itself in the 
energy- and length-scales of physical processes extending over several orders of 
magnitude. Typical examples, ranging from high (short) to low (large) energy- 
(length-) scales, are processes inside atomic cores, covalent chemical bonding, 
van der Waals interactions and magnetic couplings between unpaired electron 
spins in molecules and solids. Besides conventional many-electron systems, we 
want to consider quasiparticles, like electron hole systems, which are of fun- 
damental interest in semiconductor physics. These quasiparticles combine to 
excitonic systems similar to molecules. Of special interest are semiconductor 
heterostructures [Chu95] where excitons are confined in the direction perpen- 
dicular to the semiconductor layers. For nanostructured materials, the diam- 
eter of a free exciton becomes larger than the thickness of a layer. These ex- 
citons, therefore, provide interesting two-scale problems with properties that 
lie somewhere between that of free 2d- and 3d-excitons. 

During the last few years considerable interest in numerical techniques 
from multiresolution analysis has been emerged within the field of elec- 
tronic structure calculations. Most of this work has been done in the con- 
text of density functional theory (DFT) and the Hartree-Fock (HF) method 
[Ari99, Goe98, HF*04]. Recently published papers discuss applications to 
solids [EA02, TBJ03] and possible extensions of multiresolution analysis to 
the density-matrix approach [NTR02] which avoids the cumbersome calcu- 
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lation of eigenvalues and provides linear scaling with respect to the size of 
the system. Another interesting application are semi-classical calculations of 
polarons and bipolarons solvated in a liquid [CF*05]. Wavelets were used for 
the quantum description of the polaron and bipolaron, whereas the solvent 
has been treated classically by an integral equation. 

In DFT, the many-particle problem is mapped onto a system of noninter- 
acting particles [Kohn99], resulting in a significant computational simplifica- 
tion. All many-particle aspects are represented by an approximate exchange- 
correlation potential in the Kohn-Sham (KS) equation. The fundamental prob- 
lem of this approach is that the mapping can be done only approximately and 
systematic ways for improvement are presently still missing. Multiresolution 
analysis can therefore just be applied to multiscale aspects of independent- 
particle systems. For such systems, the most important multiscale features are 
variations of the electron density between atomic core and valence regions in 
molecules and solids. The conventional numerical approach to a simultaneous 
treatment of core and valence electrons in KS or HF equations uses Gaussian 
type orbital (GTO) bases [HJO99], consisting of atomic centered Gaussian 
basis functions with variable exponents, multiplied by harmonic polynomials. 
GTO bases have already reached a high level of sophistication in quantum 
chemistry and seem to be almost optimal for such kind of problems. How- 
ever, in contrast to wavelets, GTO bases are not stable in a mathematically 
rigorous sense and might become ill behaved if very high accuracies have to 
be achieved. There exists a variety of many-particle models in solid state 
physics where GTO bases are not really appropriate due to certain geomet- 
rical constraints imposed on the system. This is the case e.g. for quasi two- 
dimensional many-particle systems already mentioned above, like excitons in 
semiconductor heterostructures. For such kind of systems wavelets seem to be 
an interesting alternative. We proposed a multiresolution HF method for quasi 
two-dimensional systems in Ref. [FH*05b], which will be briefly discussed in 
Sect. 5. 

Beside DFT, there exists another kind of approach where a direct solution 
of the many-particle problem is tried to achieve. Starting from an effective 
noninteracting particle model, like HF, the solution of the interacting many- 
particle system can be approached via configuration interaction (CI) meth- 
ods, many-body perturbation theory (MBPT) or coupled cluster (CC) methods. 
All effects which are not taken into account by the HF solution are summa- 
rized under the phrase of electron correlations. We refer to the monographs 
[Ful93, HJO99] for a thorough discussion of this subject. The original formu- 
lations of these post HF methods rely heavily on the eigenfunctions of the HF 
Hamiltonian. This turns out to be an obstacle for a direct application of mul- 
tiresolution analysis. In order to circumvent this problem, we have adopted 
ideas from local correlation methods, which enable an efficient treatment of 
electron correlations in extended systems. Here we have to mention the local 
ansatz of Stollhoff and Fulde [Ful93, SF80], the increment method of Stoll 
[Sto80] and the local correlation methods of Pulay and Saebø [SP93], which 
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have been further developed into linear scaling methods by Werner and col- 
laborators [SHW99]. Conventional numerical approaches for these methods 
use GTO bases to represent electron correlations. It is common knowledge, 
however, that GTO bases are not well adapted for this purpose [HJO99]. 
Within our approach, GTO bases were replaced by wavelets, which provide 
stable multi-scale bases that can be locally adapted for the description of elec- 
tron correlations. In Sect. 2, we discuss wavelet approximations of correlated 
wavefunctions, cf. Refs. [FH*02, LK*02], and provide some simple examples in 
order to illustrate the underlying concepts. These concepts have been set into 
a broader context in Sect. 4, where we briefly outline various many-particle 
methods and discuss their computational complexity using a diagrammatic 
multiresolution analysis, see Refs. [FH*05a, LK*06] for further details. 

The main part of the paper focus on wavelet methods and the nonrel- 
ativistic many-particle Schrodinger equation. There are however many other 
interesting topics concerning multiscale problems in electronic structure calcu- 
lations. Two such topics related to recent work done by the group in Kassel are 
briefly outlined at the end of the paper. Appendix B summarizes recent devel- 
opments in multi-grid methods for KS and HF equations, whereas Appendix 
C provides a short glimpse into the field of relativistic electronic structure 
theory. 


2 Wavelet approximation of correlated wavefunctions 


The nonrelativistic Schrödinger equation within the Born-Oppenheimer ap- 
proximation provides a firm basis for electronic structure calculations in quan- 
tum chemistry and solid state physics. We are focusing on solutions of the 
stationary Schrodinger equation 


HW (r1,r2,...,rw) = EW (r1,r2,...,rN), (2.1) 


where the Hamiltonian 


N 
i 1 
=) (34.4 Vat) + DOA 
i=1 mt 


includes Coulomb interactions between the electrons and an external poten- 
tial due to the nuclei. Atomic units have been used throughout this paper. 
Recently, interesting results concerning the regularity of the solutions of the 
many-particle Schrédinger equation (2.1) in certain Sobolev spaces of mixed 
derivatives have been obtained by Yserentant [Yse04, Yse05]. These regular- 
ity results open the possibility for a multiscale approximation of the total 
wavefunction WY. Closely related to this approach is the sparse grid method 
of Griebel an coworkers [GG00]. We pursue a more restricted approach by 
studying multiscale aspects within conventional many-particle theories. This 
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has the significant advantage that we can rely on a wealth of experiences in 
physics and chemistry. A natural framework for the representation of many- 
electron wavefunctions ¥ is the product ansatz 


Y (r1,r2,..., rN) = FÐ (r1,r2,..., rN), (2.2) 


where the correlation operator F acts on a mean-field solution ®. In general, 
F has to be understood as a linear operator, who’s specific properties must be 
derived for example from MBPT or CC theory. We have adopted a simplified 
ansatz, where the correlation factor F (r1, r2,...,ry) represents a symmetric 
function of the electron coordinates, which corrects for the inadequacies of 
a given approximate wavefunction ®. Typically, @ is taken from the HF ap- 
proximation of the original Schrédinger equation (2.1). The HF wavefunction 
is given in terms of a Slater determinant which corresponds to an antisym- 
metrized product of orbitals. These orbitals are single-particle wavefunctions 
obtained by solving the HF equation. In general it is not possible to represent 
the exact many-particle wavefunction via a correlation factor F (r1,..., ry). 
We have to deal with a constraint variational problem where the expectation 
value of the energy 


_ f@r,...@ry FÐ (r1,r2,... rN) H FO (r1,4r2,...,rw) 


E\F\| = 
[F] f dri... dry FS (r1,r2,..., rN) FÐ (r1,r2,..., rN) 


i (2.3) 
is minimized only with respect to the correlation factor F instead of the full 
wavefunction YW. The latter would be equivalent to solving the Schrödinger 
equation (2.1). Within our previous work, we have studied wavelet approxima- 
tions of correlation factors for simple atoms [FH*02], exactly solvable many- 
particle models [LK*02] and the homogeneous electron gas [FH*05a]. This 
can be done using either a linear ansatz 


F (r1, r2,..., rN) =>) y 5 IX FP) (r1,r2,.-., rN), (2.4) 
p J A 


or an exponential ansatz, usually called Jastrow factor [Cla79], 


F (ty; Poy. cy rN) = exp 5 5 5 Ph FP (T r2,.. rN) , (2.5) 
p J A 


where symmetrized wavelet tensor products 


PY (r1... ry) = l (r1) + y$ (r2) +-+ (en), (2.6) 


FPA nrn) = 98d, (er) ER (ra) + HE Cna) V2, (wv), 


are formed from isotropic 3d-wavelets ye (r). These wavelets are constructed 


by taking mixed tensor products of univariate wavelets Vj a and scaling func- 
tions Yj a on the same level j 
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y(t) = jae (2) Pj,ay(Y) V,a (2), (2.7) 


p 
I(T) = Vj,ae(2) Pj,ay (Y) Vja (2), 


II 


Ñr) = =i aa (2) Dj,ay(¥) Pjan (2), 


72 (r) = 2 aa (2) Vj,a, (Y) Vja (2). 


For the sake of notational simplicity we have used the notation 71 ) for 3d- 
scaling functions. A brief introduction into univariate wavelets has been given 
in Appendix A. The isotropic 3d-wavelets belong to well defined levels J, 
however, there are seven different types of them, according to the various 
combinations of univariate wavelets and scaling functions. In those cases where 
details concerning type, level and location of wavelets are not relevant, we use 
single Greek wavelet indices (ya) to simplify our notation. Capital Latin and 
Greek multi-indices are used to denote arrays of indices of the same kind like 
in the case of the tensor products (2.6). 

Isotropic wavelet constructions become impracticable beyond three di- 
mensions. For the approximation of correlation factors, we, therefore, have 
to switch to sparse grids [BG04]. This construction became known in the 
wavelet community as hyperbolic wavelets [DKT98]. The concept of hyper- 
bolic wavelets is based on a special kind of hierarchical ordering and truncation 
scheme for standard tensor product wavelets 


yee te Ea) ee rp), (2.8) 


which appear in the symmetrized products (2.6). Due to their anisotropic 
character, we cannot assign a unique level to these tensor products. Instead, we 
have to take their level sums in order to get a hierarchical ordering. The sparse 
grid condition for the multilevel-index J can be expressed as a constraint on 
the shifted sum of wavelet levels 


P 


J| = SG -Ilb +1) <Q, (2.9) 


i=l 


where p is the number of 3d-wavelets in the tensor products (2.8) and ji 
are their corresponding levels. According to relation (2.9), the sparse grid 
threshold parameter Q determines the possible combinations of 3d-wavelet 
levels that are taken into account in the tensor product expansions (2.4) and 
(2.5), starting at the coarsest level lo which appears in the isotropic 3d-wavelet 
basis. As a consequence of the sparse grid condition, the number of p-electron 
tensor products (2.6) is of O(M log(M)?~+) with respect to the cardinality 
M ~ O(23@) of the underlying 3d-wavelet basis. For small p, the growth is 
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almost linear in M, however, it still increases exponentially with respect to p. 
As a consequence we can achieve only modest values of p in the expansions 
(2.4) and (2.5). A prime on the sum with respect to the multilevel-index 
J indicates that only those tensor products are taken into account, which 
either satisfy the sparse grid condition (2.9) or belong to a specific adaptive 
refinement. 

Recently, best N-term approximation theory [DeV98] has been extended 
by Nitsche to hyperbolic wavelets [Nit03]. Taking into account the asymptotic 
behaviour of Jastrow factors near coalescence points of electrons it is possi- 
ble to derive an estimate for the approximation error of the pair-correlation 
function in the H! Sobolev space [FHS06a] 


inf — FO —S PP FP m SN, (2.10) 
HLFD ISN D 


where #{F e y denotes the number of two-electron tensor products (2.6) in 
the expansion (2.5). In best N-term approximations these tensor products 
have to be chosen in order to minimize the approximation error in H+. The 
estimate (2.10) is sharp for all sufficiently regular wavelets with a minimum 
number of vanishing moments. In the case of hyperbolic 6d-wavelets based 
on isotropic 3d-wavelets ya, this estimate requires at least three vanishing 
moments for the underlying univariate wavelet ~. For comparison, a direct 
construction of hyperbolic 6d-wavelets from a univariate wavelet basis re- 
quires only two vanishing moments for the same estimate. It is essentially the 
inter-electron cusp at coalescence points of electrons that restricts the order 
of approximation. The hyperbolic tensor products from isotropic 3d-wavelets 
are especially convenient for an adaptive refinement in the neighbourhood 
of a cusp [FH*02, FH*05a]. Numerical studies for an exactly solvable many- 
particle model [LK*02] and the homogeneous electron gas [FH*05a] demon- 
strate that a hyperbolic tensor product construction according to the sparse 
grid criteria (2.9) with additional diagonal refinement turns out to be very 
close to best N-term approximation. 


3 Variational calculations for the linear ansatz 


The Rayleigh-Ritz variational principal can be applied without further ap- 
proximations to the expectation value of the energy (2.3) for the linear expan- 
sion of the correlation factor (2.4). It yields a generalized eigenvalue problem 
of the form 

Hf=EMEf, (3.1) 


for the variational parameters f4, with matrix elements 


Hoa = f Pri... drn Fob H Fad (3.2) 
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Maa= |ë r,...,@ry Fob Fy®. (3.3) 


This direct approach avoids any uncontrolled approximations. We want to 
mention however, that in order to develop methods suitable for large scale 
applications it becomes necessary to switch from the linear to the exponential 
ansatz (2.5) in order to keep the computational complexity within reasonable 
bounds. This topic is further discussed in Sect. 3.2. 

Before we start to review some applications of the linear ansatz, a few 
remarks are in order concerning the computation of the matrix elements (3.2) 
and (3.3). It turns out that this step is the bottleneck of our approach and 
requires a careful analysis concerning the computational complexity. Due to 
the tensor product ansatz for the correlation factor F, the Coulomb interaction 
part of the matrix elements Hy, factors into the standard one-electron 


(nal) = fer, nee (3.4) 


and two-electron 


1 1 
(malin) = f rdr nalts) 


gee) B85) 


Coulomb integrals. These integrals have to be calculated for various products 
of orbitals ¢; and isotropic 3d-wavelets 


di(r) oj (r) 
nalr) = 4 oi(r)¢;(t)Ya(r) 
pildi) lr) (r). 


The factorization of matrix elements (3.2) and (3.3) into lower dimensional 
integrals is already a complex task for systems containing more than two 
particles. We have to rely on diagrammatic techniques from many-particle 
theory in order to derive a recurrence scheme for the evaluation of these matrix 
elements from the basic integrals (3.4) and (3.5). In Sect. 4.1, we present a 
brief outline of the recurrence scheme and of the underlying diagrammatic 
techniques. 

A new numerical feature arises from the products of wavelets and orbitals 
which appear in the integrals (3.4) and (3.5). Estimates for wavelet products, 
based on their Sobolev regularity, have been derived in Ref. [FH*02], indicat- 
ing that a sufficiently high regularity of the wavelet basis is required for an 
efficient numerical treatment of these integrals. In an intermediate step, we 
first calculate the auxiliary function 


1 
R(rı) = [an (eae np (r2) 


by performing a matrix times vector multiplication in the nonstandard rep- 
resentation [BCR91, Bey92] of the Coulomb interaction and product function 
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np. For wavelets with several vanishing moments, the Coulomb interaction 
matrix elements decay very fast with increasing distance from the diagonal. If 
wavelets appear in the product function 7p, it has compact support and the 
corresponding wavelet expansion is strongly peaked around the wavelet be- 
longing to the finest scale in the product. The auxiliary function R therefore 
has a sparse representation in the dual wavelet basis 


Rr) & 5 Ra alr). 


In order to calculate the two-electron Coulomb integrals (3.5) it remains to 
perform a scalar product with the product function 7a 


1 ~a 
(na | — | np) ~ Sina | ¥)Re 


a 


Both vectors involved in the scalar product have sparse representations in the 
wavelet basis. 


3.1 Results for the helium atom and a many-particle model 


We have demonstrated the feasibility of our method by applying it to the 
helium atom and isoelectronic Lit ion. Actually this is a standard model for 
electron correlations in quantum chemistry, which shows many characteristic 
features of large systems. From a technical point of view, it already required 
the full machinery for calculating one- and two-electron integrals within the 
wavelet basis. For our calculations we have taken the biorthogonal SDD6 
wavelets with six vanishing moments [Swe96]. Further details concerning the 
wavelet basis are given in Appendix A. Our results obtained so far are only of 
preliminary character, due to the fact that the isotropic 3d-wavelet basis ya 
was only crudely adapted to the size of the atoms. For full technical details 
we refer to Ref. [FH*02]. Nevertheless our wavelet approximations recovered 
98% and 97% of the correlation energy * for He and Lit, respectively. The 
correlation energies for various levels of refinement are shown in Fig. 3.1. 
As a reference, the wavelet level j = 0 corresponds to a grid separation of 
1 bohr. In the case of the He atom, the dominant contribution comes from 
the level j = —2, which already recovered 83% of the correlation energy. 
Compared with it, the contribution of the next coarser level 7 = —3 is almost 
negligible (< 0.1 mhartree). At finer scales, contributions to the correlation 
energy decrease quite fast. A definite statement concerning the asymptotic 
convergence behaviour cannot yet be drawn from our calculations. According 
to the estimate (2.10), we expect an asymptotic convergence of O(2~°/) where 
we assumed diagonal dominance due to the inter-electron cusp. Our results 
show that rather accurate energies can be achieved already with a rather 


4 The correlation energy corresponds to the difference between exact and HF energy 
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Fig. 3.1. Correlation energies of He and Li* from variational and JPT2 calculations 
for various levels of refinement. The level j = 0 corresponds to a resolution length 
of 1 bohr. Calculations have been performed with the SDD6 wavelet. 


small number of wavelets. This is encouraging in view of the fact that we have 
used regular Cartesian grids, which were not especially adapted to spherical 
symmetry, except that we have placed the nucleus at the origin. 

Exactly solvable many-particle models provide an interesting playground 
for method development. Especially convenient for our purposes was a one 
dimensional bosonic model suggested by Koprucki and Wagner [KW00]. The 
Hamiltonian of this model is given by 


N N 
1 1 1 
A model ae -3 > ðr? + 2 X a? + X (5e: = xj) p z% ra zl) ’ (3.6) 


i=1 i<j 


which corresponds to a system of coupled harmonic oscillators with repulsive 
contact and long-range interactions. The bosonic ground state wavefunctions 
of this model 


N 
1 1 
Wo(x1,%2,.--,0N) =N] [ep she - z; Lee |-333| ; 


i<j 


have a simple product structure with a cusp similar to electrons. We have 
studied the convergence behaviour of various approximation schemes [LK*02] 
in L°, L? and H! norms and for the energy. Because of the simplicity of 
the model, we were able to consider fully adaptive wavelet grids where tensor 


Multiscale Electronic Structure Calculations 309 














3 T T 2 3 
Ga ja) 
Ẹ& @ (-2,-2) 
Vv (-2,-1) 
E | È E s | E (-2,0) 
a fi A (-1,-1) 
+ © (-1,0) 
1+ A o A o 4 40,0) 
Sed * (0,1) 
° * Vs + (1,1) 
* Vv Gt V v (1,2) 
x OF ee J 0 (2,2) 
v + vV x 
* A * © 
ot y 
-1+ > A > A 4 
+ 
=» A 
25+ e E e J 
> 
zg L L 1 fi L 
-3 -2 -1 0 1 2 3 


Fig. 3.2. Fully adaptive wavelet grid for the two-particle model (3.6), based on the 
contribution of tensor products Yj; a1 (£1) Wj2,a9 (£2) to the energy. 


products have been selected according to their contribution to the energy. 
These grids, depicted in Fig. 3.2 for the two-particle case, show pronounced 
diagonal dominance for wavelets on fine levels. It turned out that hyperbolic 
wavelets with appropriate diagonal refinement at the cusps have comparable 
computational complexity as a fully adaptive treatment based on the energy 
contribution of individual wavelet tensor products. For both schemes, the 
energy shows an asymptotic convergence rate of O(2~4) with respect to the 
level of refinement. This roughly corresponds to O(N~!) convergence with 
respect to number of tensor product wavelets. 


3.2 Size-consistency error for the linear ansatz 


The linear hyperbolic wavelet expansion of the correlation factor (2.4) enables 
a strictly variational treatment of Schrodinger’s equation via the generalized 
eigenvalue problem (3.1). For typical applications to extended systems, size- 
consistency ° is required, which means that expectation values of operators 


5 In the terminology of quantum chemistry this property is often called size- 
extensivity. 
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Fig. 3.3. Size consistency error per atom AE = Ey.,,/N — Eve (mhartree) of 
hyperbolic wavelet and SDCI expansions for systems of noninteracting helium atoms. 
The wavelet approximation for the correlation factor of a single helium atom has 
been taken from Ref. [FH*02] (Eye = -2.89904 hartree, two levels j = —2,—1). Two 
different sparse grid constraints Q = 4 (o) and Q = 6 (*) have been imposed to the 
product wavefunction of the helium clusters. For comparison SDCI calculations (0) 
have been performed with a s,p VQZ basis set (Eye = -2.90015 hartree). 


that correspond to extensive thermodynamic properties, like the energy, are 
proportional to the system size. Within the linear ansatz, size-consistency 
is hard to achieve. Not only an inflationary number of variational parame- 
ters, but even more severe, an increasing complexity of the matrix elements 
(3.2) and (3.3) makes the linear hyperbolic wavelet expansion to a challeng- 
ing problem for large systems. It has been already demonstrated by one of 
us [Hac01], that in order to keep the computational complexity under control 
an additional constraint has to be imposed. Besides the sparse grid condition, 
we have to truncate the first sum in the expansion (2.4) at p < pmax < N, 
where pmax may slightly vary with the size of the system. This resembles to 
truncation schemes for the CI method in quantum chemistry, where typically 
only single and double excitations (SDCI) are taken into account. 

In order to illustrate this problem, we have studied the size-consistency 
error of hyperbolic wavelet expansions for a standard benchmark problem in 
quantum chemistry, namely systems of N noninteracting helium atoms. Such 
kind of model recovers essential physical aspects of the problem [HJO99]. 
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Due to Pauli’s principle, molecules can roughly be described by interacting 
electron pairs. The dominant intra-pair interactions are taken into account by 
this model, whereas inter-pair correlation effects are excluded. The latter are 
more subtle and not easily accessible to analysis. For the sake of computational 
simplicity, we consider a standard wavelet tensor product expansion for a 
single helium atom 


Wye (r1, r2) -È 3 > {Pet FP (r1, r2)| (r1) (r2), (3.7) 


where the sparse grid condition has not been applied. The corresponding wave- 
function for a cluster of N noninteracting helium atoms is given by the product 


N 
Wien (rı ais Ton) = J [7 (r2i—1, Lai) 7 (3.8) 


i=1 


A special permutational symmetry with respect to an interchange of variables 
among different helium atoms is not required due to the underlying assump- 
tion of a large spatial separation of the atoms. Obviously, the total energy 
scales linearly with the number of helium atoms Eye, = N Epe. After some 
reordering of the product wavefunction (3.8), the wavelet tensor products for 
the cluster can be arranged like in the linear ansatz (2.4) 


Pik (r1...an) = IEA (r2i—1;r2:), with p= Se pe. 


Applying the hyperbolic wavelet approximation to the reordered wavefunction 


Wey (T1-.-L2Nn) = |X 5 DA T Ton) Io T2i—1) P(rai), 


P \J|<Q A 


only those tensor products Fy are taken into account for which the sparse 


grid condition (2.9) is satisfied. On each truncation level Q, the coefficients 


re have been reoptimized with respect to the energy. In order to limit the 


computational effort, we have contracted the atomic wavelet coefficients on 
each of the tensor product levels J. This did not introduce any noticeable 
error into our calculations. In Fig. 3.3 we have shown the size consistency 
error per atom AF = Eye, /N — Eye in the case of a two-scale wavelet 
expansion for a single helium atom. We refer to Ref. [FH*02] for further details 
concerning this wavefunction. For both thresholds Q =4,6,a single helium 
atom can afford all tensor products in the wavefunction (3.7), however, for 
clusters of several helium atoms, restrictions with respect to the full product 
wavefunction (3.8) are imposed. The hyperbolic wavelet approximation is less 
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stringent than SDCI, in a sense that certain multicenter tensor products are 
still possible. For illustration we compare in Fig. 3.3 with SDCI calculations 
in a s,p valence quadruple zeta (VQZ) basis set. It can be seen that the size 
consistency error is much smaller already for Q = 4 compared to the SDCI 
method. Increasing the sparse grid parameter to Q = 6 results in a very 
small size consistency error, which seems to be acceptable for systems with 
up to twenty electrons. In principle it appears to be possible, to adjust the 
parameter Q to the system size in order to achieve a given accuracy. However 
in view of the difficulties concerning the evaluation of matrix elements, this 
becomes impracticable beyond a certain size of the system. For comparison, 
the exponential Jastrow ansatz (2.5) preserves the product structure of the 
wavefunction (3.8) if only one- and two-particle tensor products (2.6) are taken 
into account. 


4 Jastrow perturbation theory and the local ansatz 


A rigorous treatment of the exponential ansatz (2.5), based on Rayleigh-Ritz’s 
variational principle, is almost unfeasible due to the highly nonlinear character 
of the variational problem. Instead it is possible to derive various approxima- 
tion schemes which preserve the size-consistency of the exponential ansatz. 
For Jastrow-type wavefunctions, such kind of methods are the local ansatz of 
Stollhoff and Fulde [Ful93, SF80] and Krotscheck’s Fermi hypernetted chain 
(FHNC) method [CKP92, Kro85]. Without additional symmetries, the non- 
linear system of FHNC equations is very challenging from a computational 
point of view and we refrain from a numerical treatment within multiresolu- 
tion analysis. We have adapted, however, the diagrammatic representation of 
matrix elements, which we have slightly modified by introducing additional 
elements from multiresolution analysis. Instead of the nonlinear FHNC equa- 
tions, we have taken the local ansatz as a guideline for our work. The local 
ansatz can be considered as a power series expansion of the exponential ansatz 
[SF80] which leads to a linear system of equations 








SO ((Fh H Fa) — (Fh Fa) (H)) fa = (Fh Hi) (4.1) 
A 


where the correlation energy is given by the expression 


Ecor = X (Hı Fa) fa. (4.2) 


A 


To simplify our notation we have introduced a short-hand notation (---) := 
(®yr|---|Gur) for expectation values with respect to the HF wavefunction. 
The residual interaction Hı := H — Hgcp is defined as the difference between 
the exact and HF Hamiltonian. Furthermore, we have assumed (F4) = 0, 


Multiscale Electronic Structure Calculations 313 


which can be imposed in a natural way using the formalism of second quan- 
tization discussed below. The considerable simplification due to linearization 
is at the expense of an accurate treatment of long-range correlations. For our 
envisaged applications in quantum chemistry and solid state physics, where 
electrons or excitons are confined by an external potential, this is perfectly jus- 
tified. Whereas long-range correlations become important for metallic systems 
[Ful93] where they cause a divergence in the energy for finite order perturba- 
tion expansions. Guided by the local ansatz, we have studied a perturbative 
approach for Jastrow factors [LK*06]. Such kind of approach seems to be nat- 
ural from a physical point of view, however, to the best of our knowledge, 
it has not been described in the literature. This is likely due to the failure 
of finite order perturbation theory for most of the standard applications of 
Jastrow factors in condensed matter physics, where long-range correlations 
are important. 

From a formal point of view, Jastrow perturbation theory (JPT) is closely 
related to coupled cluster perturbation theory (CCPT) presented in [HJO99]. 
The symmetrized tensor products (2.6) are conveniently expressed in terms of 
second quantization, which introduces additional flexibility into the perturba- 
tion analysis. This can be used to remove redundancies from the many-particle 
basis and to reduce computational complexity which is essential for practical 
applications. In order to illustrate our assertions, we consider an arbitrary 
two-particle basis function as an operator in second quantization 


A 1 
Fo = Fe), 5 S (pg FO st) ch chercs, (4.3) 
past 
with 
(oul Fst) = f | Parda byl: by (22) (tx) 79(#2) bel )O1(%) 
+a — p 


where x; := (ri, ci) denotes the combined spatial and spin coordinate of a par- 
ticle. The creation ch and annihilation c, operators refer to the single particle 
orbitals ġa of Hscr. In the following, we denote virtual orbitals by a, },---, 
occupied orbitals by 7,7,--- and arbitrary orbitals by p,q,---. We want to 
mention that virtual orbitals {ġa} are introduced for purely formal reasons 
and that for actual calculations only occupied orbitals {¢;} are required. This 
is due to the fact that the underlying tensor product basis (2.6) consists of 
simple functions which allows us to use the identity 


> ba(X1)b9(X2) = ô(r1 — r2) fo ,o2 > bi(x1)o (4.4) 


we refer to Ref. [FH*05a] for further details. An obvious choice, in the spirit 
of CC theory, for simplified two-particle operators (4.3) are cluster-type op- 
erators 
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FED = 5S (ad FOr ia)chchejes, (4.5) 
abij 

which preserve commutativity of the many-particle basis. This property turns 
out to be essential for the exponential perturbation scheme. On a first glance, 
the second quantized Jastrow ansatz (4.5) resembles closely to CC theory. 
There is however an essential difference inasmuch as the underlying function 
basis (2.6) does not guaranty convergence of the product ansatz (2.2) to the 
exact wavefunction in the complete basis set limit. For that reason we take 
the variational energy (2.3) as the starting point for our perturbation analysis 
instead of the CC energy and projection equations. The latter assume an exact 
ansatz for the wavefunction and provide the basis for the CCPT approach 
[HJO99]. 

We have performed a perturbation analysis for the linear and exponential 
representation of the correlation factor [LK*06]. For cluster-type operators, 
we obtain in the linear case a natural truncation scheme with respect to 
the degree p of the many-particle basis F (P) that preserves size-consistency 
for the energy. It requires e.g. only two-particle operators at second-order 
but four-particle operators already for third-order energies. In order to avoid 
such an increase of computational complexity, we switched to the exponential 
Jastrow-type ansatz, where size-consistency can be achieved by restricting to 
two-particle operators at all orders of perturbation theory. The first order 
approximation to the correlation operator Ê = DD Fa fa, given by 


S (Fh Hscr Fa) — (FL F4)(Hscr)) faa =-(FbM), (46) 
A 


yields the second order (JPT2) correlation energy 


Ey = XCA Fa) faa- 


A 


By comparison with the local ansatz (4.1) and (4.2), we observe a close rela- 
tionship between both approaches. Actually it turns out that the local ansatz 
is almost equivalent to third-order perturbation theory. Concerning compu- 
tational complexity, the JPT2 method is considerably simpler than the local 
ansatz. The Hamiltonian on the left hand side of Eq. (4.6) contains only one- 
particle operators in contrast to the local ansatz (4.1) where the full Hamil- 
tonian appears. 

In order to demonstrate the accuracy of the JPT2 method, we compare 
in Fig. 3.1 the correlation energies for He and Lit, at various levels of refine- 
ment, with those obtained from variational calculations. It can be seen that 
JMP2 overestimates the correlation energy by ~ 10% for these two-electron 
systems. This has to be compared with standard second order Mgller-Plesset 
perturbation theory which underestimates the correlation energy for He by 
a similar amount. Another property of interest is the behaviour of the wave- 
function near the inter-electron cusp. We have plotted in Fig. 4.1 the angular 
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Fig. 4.1. Angular dependence of the correlation function FY+4F) of the He atom 
at various levels of refinement. The wavelet level j = 0 corresponds to a resolution 
length of 1 bohr. Both electrons are at a distance of 0.6 bohr with respect to the 
nucleus. Calculations have been performed with the SDD6 wavelet. For comparison, 
the corresponding correlation function obtained from a highly accurate Hylleraas CI 
calculation is shown. 


dependence of the correlation function F® + F?) of the He atom at various 
levels of refinement. Both electrons were kept fixed at a distance of 0.6 bohr 
with respect to the nucleus. The accuracy of the wavelet approximations can 
be judged by comparing them with a correlation function obtained from a 
very accurate Hylleraas CI calculation [FHM84]. Despite of the rather crude 
approximation, our finest level corresponds to a resolution length of 0.5 bohr, 
the JMP2 correlation function provides a fairly accurate description of the 
correlation hole. 


4.1 Diagrammatic multiresolution analysis 


Straightforward application of diagrammatic techniques from many-particle 
theory [LM86] enables a formal evaluation of the matrix elements which ap- 
pear in the local ansatz (4.1) or JPT2 method (4.6). The resulting Goldstone 
diagrams can be further transformed into FHNC diagrams [CKP92] using the 
identity (4.4). We have studied the computational complexity for the evalua- 
tion of these diagrams in detail for the local ansatz [FH*05a]. An important 
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Fig. 4.2. Diagrammatic elements of multiresolution analysis. 


aspect of this analysis is the multiscale structure of the underlying wavelet 
basis. This leads to a diagrammatic multiresolution analysis where diagrams 
can be classified according to their contribution on specific energy- and length- 
scales. The diagrammatic elements of our multiresolution analysis, depicted 
in Fig. 4.2, have been adopted from FHNC theory [CKP92]. By a slight abuse 
of our notation (2.6), we have introduced the pair-correlation basis functions 


T (r1,r2) = Ya (r1) Ya(r2) + Ya(r2) ya(r1)- (4.7) 


Each vertex in a diagram corresponds to an integral f d?r. Spinless HF density 
matrices p(r1, r2) are represented in terms of spatial orbitals 


plr, r2) = X. mi ilr) oi(r2)*, 


i 


where n; is the occupation number of the 7’th orbital. 

Because of the tensor product structure of pair-correlation basis functions 
(4.7), it becomes possible to build up all the diagrams in a recurrent manner 
from a few types of basic diagrams. At the onset of our recurrence scheme we 
consider those parts of the diagrams which are directly linked to the Coulomb 
interaction. These basic Coulomb diagrams, shown in Fig. 4.3, determine the 
computational complexity of our approach. Various types of pointwise prod- 
ucts and wavelet tensor products appear in the basic Coulomb diagrams. The 
cardinalities of the corresponding sets with respect to the size M = O(23@) 
of the underlying isotropic 3d-wavelet basis (2.7) are listed in Table 4.1. In 
order to derive these cardinality estimates, we have taken into account the 
compact support and hierarchical structure of the 3d-wavelets as well as the 
sparse grid condition (2.9) for hyperbolic tensor products. 

The number of basic Coulomb diagrams increases almost quadratically 
with respect to the isotropic 3d-wavelet basis. In order to achieve further re- 
ductions to almost linear computational complexity, we considered the effects 
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Fig. 4.3. Basic Coulomb diagrams of the recurrence scheme. 


Table 4.1. Cardinalities of certain sets of pointwise and tensor products for isotropic 
3d-wavelets which appear in the diagrammatic multiresolution analysis. 


Wavelet (tensor) product Diagrammatic element Cardinality 


Ya(r) 0 M 
Yo(r) ya(r) ” O(M log(M)) 
Fap (t1,r2) EOS O(M log(M)) 
Yalrı) Fup (r1, r2) Q HHEH o(M?’) 
Fag (£1,£2) Foy (r2, r3) ME ESVE O(M? log(M)) 
A Fy ia pee eeeee O(M?) 


of the vanishing moment property of wavelets on various sets of diagrams. 
This requires further assumptions concerning the smoothness of the density 
matrix. Obviously, nuclear cusps or a highly oscillating behaviour of the den- 
sity matrix within the support of a wavelet spoils this useful property. If such 
kind of behaviour is restricted to atomic core regions, which correspond to a 
comparatively small portion of the total volume, it can be handled by adap- 
tive local refinements of the 3d-wavelet basis. For our initial studies we have 
neglected such kind of complications and essentially considered valence elec- 
tron systems. A prominent example is the homogeneous electron gas, where 
oscillations are bounded by the Fermi momentum. This led us to a simplified 
model for the sets of basic type 3, 5 and 6 diagrams 
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Fig. 4.4. a) Model integrals I aE «,k’) for the basic type 3, 5, and 6 diagrams. 
Absolute values of the integrals I” (a, K, 6) are plotted on a logarithmic scale 
versus the distance |a|. The plane wave parameters were chosen to be isotropic 
k1 = k2 = «3 with values K; = 2 (o). Isotropic 3d-wavelets and scaling functions 
were generated from the univariate SDD6 wavelet basis. 


1, k, k) = fan [ar e en 2) (rı) err ee af (r2), 

(4.8) 
where only wavelets and the Coulomb interaction are left. All effects of the 
remainder of the diagrams on the vertices are represented by plane waves, 
through which we modulated the oscillations by varying the momenta k, K’. 
The presence of oscillatory functions in the integrand has important conse- 
quences for its large distance behaviour i.e. |a| — oo. Our numerical studies 
for the model integrals (4.8), shown in Fig. 4.4, demonstrate that the van- 
ishing moment property only provides additional sparsity if 2~/|«;| is smaller 
than a certain critical value. Such kind of sparsity can be related to the be- 
haviour of the derivatives of the Fourier transform of the mother wavelet near 
the origin. 

Pointwise wavelet products on the vertices of diagrams also destroy the 
vanishing moment property. This partially affects the sets of basic type 5 and 
6 diagrams and is most severe for the sets of basic type 7 and 8 diagrams, where 
wavelet products appear on both vertices of the Coulomb interaction. Since 
the cardinalities of these sets of diagrams increase quadratically with the size 
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Fig. 4.5. Selected diagrams for numerical studies with FHNC//0 pair-correlation 
function for a homogeneous electron gas. 


of the 3d-wavelet basis it is essential to achieve additional sparsity through the 
vanishing moment property. A possible resolution to this problem are wavelet 
expansions of pointwise wavelet products on the vertices of the diagrams. This 
results in an almost linear complexity for all of the basic Coulomb diagrams. 
Similar arguments apply to the remaining parts of the recurrence scheme. 

We have studied the convergence behaviour of wavelet expansions for se- 
lected diagrams, shown in Fig. 4.5, in the case of a homogeneous electron 
gas for which “optimal” pair-correlation functions Fe. (11,r2) have been 
obtained from FHNC//0 calculations [KKQ85]. For our calculations we have 
chosen an electron density of rs = 2.07 (xp = 0.927 bohr~'). This corresponds 
to the average valence electron density in aluminium. The cubic supercell with 
edge length 12 bohr contains 54 electrons. The FHNC//0 pair-correlation 
function was expanded in terms of hyperbolic wavelet tensor products with 
additional diagonal refinement 


FEkol (r1, r2) > faFo (r1, r2). 


The approximation error for the diagrams D1 to D4 provides some insight 
into the performance of our approach for realistic problems. It can be seen 
from Fig. 4.6 that the overall convergence of these diagrams is rather similar. 
Fairly high accuracies have been already achieved at rather coarse levels. This 
is due to relatively weak couplings between pair-correlation functions and the 
Coulomb potential in the diagrams D1 to D4. 


5 Multiresolution Hartree-Fock for quasi 
two-dimensional extended systems 


Our further development of diagrammatic multiresolution analysis is aim- 
ing towards applications in the field of semiconductor physics, especially for 
strongly anisotropic systems like semiconductor heterostructures. A common 
feature of our envisaged applications is that they are closely related to molec- 
ular systems. Traditional methods in quantum chemistry based on Gaussian 
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Fig. 4.6. Convergence of selected diagrams for hyperbolic wavelet expansions with 
diagonal refinement of a FHNC//0 pair-correlation function in the case of a homo- 
geneous electron gas at rs = 2.07. Isotropic 3d-wavelets have been obtained from 
the univariate SDD6 wavelet basis. Translational symmetry has been only taken into 
account with respect to the coarsest wavelet level Jo = —2 in the expansion. 


type basis functions, however, cannot be easily applied due to the pronounced 
two-scale character of these systems. In contrast to this, wavelet bases can 
be adapted in a straightforward manner to highly anisotropic problems. As a 
preparatory step for the implementation of the local ansatz or JPT methods, 
we have developed a multiresolution HF method for quasi two-dimensional 
extended many-particle systems [FH*05b]. Most prominent examples are jel- 
lium models for metallic surfaces and slabs. Within these models, the posi- 
tively charged atomic cores are approximated by a constant background charge 
density [NP01]. Such kind of models are homogeneous in two directions and 
show a strongly inhomogeneous behaviour in the third direction, which is 
perpendicular to the surface of the slab. Due to the homogeneity of jellium 
models in two dimensions, HF orbitals 


On (t) = Ga (x) err (5.1) 


factor into products of two dimensional plane waves and functions Ç% (a) that 
describe the behaviour of the orbitals perpendicular to the surface. Inside the 
slab these functions behave like plane waves and decay exponentially at the 
surface. Amazingly there has been no exact HF calculations for jellium sur- 
faces or slabs reported in the literature. This is due to the nonlocal exchange 
operator present in the HF equations, which makes the perpendicular part 


Multiscale Electronic Structure Calculations 321 


of the orbitals ¢*(x) depending on the momenta parallel to the slab. How- 
ever, approximate HF calculations, following the pioneering work of Bardeen 
[Bar36], have been published [SKG77, SM80]. Within these calculations, the 
nonlocal exchange operator was approximated by a local potential. As a conse- 
quence, the perpendicular part ¢**(x) does no longer depend on the momenta 
in the parallel directions, which greatly simplifies the numerical treatment. 
The same argument applies to DFT, where only local potentials appear in 
the KS equation. 

Starting with Lang and Kohn’s seminal paper [LK71], jellium surfaces 
have been extensively studied for various kinds of density functionals [HJ74, 
LP75, PEO1]. They proved to be of considerable significance for the develop- 
ment of new density functionals. Therefore a strong need for accurate many- 
particle benchmark calculations exists. However, due to the presence of peri- 
odic boundary conditions and the absence of localized orbitals, these systems 
pose great technical difficulties for traditional quantum many-particle meth- 
ods. There are essentially only two such methods which can deal with jellium 
surfaces, quantum Monte Carlo (QMC) [AC96, LN*92] and the FHNC method 
[KKQ85]. It exists an ongoing controversy concerning the accuracy of these 
calculations [NP01]. Significant deviations with respect to results obtained 
from DFT calculations exist and presently the reason for these discrepancies 
has not been settled. 

In order to solve the HF equation with periodic boundary conditions, we 
have chosen a supercell approach, similar to the setting used for QMC calcu- 
lations [AC96, LN*92]. This seems to be an appropriate choice with respect to 
intended extensions to many-particle methods like the local ansatz. A further 
significant advantage of this approach is that it can handle impurities, which 
becomes important for applications where electrons or excitons are confined in 
quantum dots embedded into a semiconductor heterostructure. The supercell 
approach is based on the standard many-particle Schrödinger equation (2.1), 
where the Coulomb interaction between electrons has been replaced by the 
Ewald potential which takes into account the Coulomb interaction between 
electrons inside the supercell as well as with all their images in periodic copies 
of the supercell. Due to the quasi two-dimensional character of the systems, 
we expect that the coupling between the perpendicular and parallel directions 
in the Hamiltonian remains rather weak. The orbitals are therefore expressed 
as linear combinations 


of (r) TD DAER Si ») (5.2) 


of tensor product basis functions 


K 
Eile) = be (x) G;""(y, 2), 

where univariate supercell wavelets Y$! are taken in the perpendicular direc- 

tion and contracted isotropic 2d-supercell wavelets ae l in the parallel direc- 
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tions 


Pl (y = Lidar U y,z (5.3) 


For a detailed presentation of our approach we refer to Ref. [FH*05b]. In view 
of Eq. (5.1) for the jellium case, it is reasonable to assume that the Kronecker 
rank K of the tensor product decomposition for HF orbitals (5.2) can be 
kept small and almost independent of the size of the supercell through an 
appropriate choice of the contracted basis functions (5.3). A possible choice 
for these basis functions are HF orbitals of the corresponding two dimensional 
homogeneous system or systematic basis functions like plane waves. 
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Appendices 


A Basic notions of multiresolution analysis 


The purpose of this appendix is to provide some basic definitions and proper- 
ties of wavelets which are required for this paper. For a complete exposition 
of this subject, we refer to the excellent monographs of Daubechies [Dau92], 
Meyer [Mey92] and Mallat [Mal98]. 

In one dimension, multiresolution analysis provides a partition of the 
Hilbert space L?(R) into an infinite sequence of ascending subspaces --+ C 
Vj-1 C Vj C Vj41 C +++, where the index j runs over all integers. The union 
of these subspaces |J, V; is dense in L?(R). On each subspace Vj, the scaling 
function y(x) generates a basis 





~j,a(L) := 25/2 (a — a), (A.1) 


via the operations of dilation and translation. The dilation factor 2) scales 
the size of the basis functions, which means that with increasing j, the ~j,4 
provide a finer resolution in L*(IR). An explicit embedding of V; into the larger 
space V;+1 is given by the refinement relation 


pla) = 2Y ha p(22 — a), 


where the number of nonzero filter coefficients ha is finite for the scaling func- 
tions used in our applications. Wavelet spaces W; are defined as complements 
of Vj in Vj41. The corresponding wavelet basis is generated from a mother 
wavelet ¢(x) analogous to Eq. (A.1) 


Pj alt) := 29/2q(2a — a). 
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This construction leads to a hierarchical decomposition of L?(R) = @, jez Wi 
into wavelet subspaces W; [Dau92]. In a biorthogonal wavelet basis there exists 
a sequence of dual spaces V;, W;, which satisfy the orthogonality relations 
W; L Vj and V; 1 W;. The corresponding dual wavelets Via: — 25/24) (QW a— a) 
and scaling functions j,q := 2//2¢(2/a — a) generate a biorthogonal basis in 
L*(R) . 
(Y5,a|23,b) = a,b, (Wi altib) = 95,1 Ja,b- 

An arbitrary function in L?(R) can be expanded in a biorthogonal wavelet 

basis 


f(x) = X (bual f) ) Plo,a(x ESD ) Vjal), (A.2) 


a j=lo a 


Vio Doss W: 








where the scaling function and wavelet coefficients are given by scalar prod- 
ucts with respect to the dual basis. The multiscale approximation of smooth 
functions reveals an important sparsity feature due to the vanishing moments 
property of wavelets. Depending on the specific choice of the wavelet basis, a 
certain number of moments vanish 


[axs* ia) =0, for k=0,...,n—1. 


This property has a significant effect on the magnitude of wavelet coefficients, 
as can be seen from local Taylor series expansions 


f(a) = co + + cnal — 24a)” * + Ra-a(z) (a2 — 27a)”, (A.3) 


around the centers of wavelets Üj ala). Inserting the Taylor series expansions 
(A.3) into the scalar products yields the following estimate for the wavelet 


coefficients 
| fen dx f £) Djal alt) 


SUD sig a} En- (0)| 2770+12 fo 


II 


lbs alf)| 





x" p(x), 








where the supremum of the remainder |R,_1(«)| has to be taken with respect 
to the support of the wavelet Yj ale). For functions with rapidly converging 
local Taylor series, the corresponding wavelet expansions (A.2), therefore, 
converge very fast with respect to the level j. This leads to sparse wavelet 
representations for these functions. 

For our numerical studies, we have used the univariate symmetric biorthog- 
onal wavelets with six vanishing moments of Sweldens [Swe96] and the corre- 
sponding univariate scaling function of Deslauriers and Dubuc [DD89]. In the 
text, we refer to this basis as the SDD6 wavelet basis. 
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B Multi-grid methods for Kohn-Sham and Hartree-Fock 
equations 


An alternative approach to correlated many-electron systems is the density 
functional method [Kohn99]. There, multiscale aspects arise from the use of 
multi-grid (MG) methods for solving the discretized KS equation and from the 
different spatial scales defined by the physics of the KS orbitals, in particular 
if heavy atoms are involved. The spatial extension of core and valence orbitals 
may differ by more than two orders of magnitude. This has to be compared 
with the high accuracy requirements of quantum chemical calculations. A 
relative accuracy of 1078 t° -9 in total energies is required in oder to obtain 
three significant digits for the binding energy of a molecule with heavy atoms. 
From these considerations it is obvious that measures have to be taken in 
order to keep the computational effort under control. 

The finite element method (FEM) can be used in order to discretize the 
non-relativistic energy density functional 


x : 1 
B= X ( lilo) -3 f otrvulear (B.1 
=F S oe WVereln)er + Enuc, 


where the one-particle Hamiltonian 
a 1 
h= -3â i VH F Vese A Vauc, 


contains the Hartree potential Vy, the exchange-correlation potential Vezc, 
and the electron-nucleus external potential Vpuc. Actually, ES, = ES lp] can 
be considered as a functional of the single particle density p = >; |¢:|?. Varia- 
tion of the functional (B.1) with respect to the orbitals ¢;, under conservation 


of their norms, leads to the KS eigenvalue equation 
hdi(r) = eidi(r). (B.2) 


The KS equation was first solved, using a combination of FEM and MG, 
for diatomic molecules which enable a reduction to two dimensional prob- 
lems due to rotational symmetry. For a better analytic adaptation to the 
Coulomb singularities, two successive transformations of the coordinates have 
been performed. Namely first to elliptic hyperbolic coordinates and then by 
a further transformation to intrinsic coordinates (see e.g. Ref. [KK01]) where 
the ansatz functions are polynomials and the hierarchical grids are thus easily 
established. In cartesian coordinates this corresponds to complicated transcen- 
dental approximating functions on rotated triangular domains with curved 
boundaries. The MG method allowed a fast and comprehensive investigation 
of many different types of density functionals (LDA, GGA and the CS orbital 
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functional) for dimers of the first long period chemical elements ranging from 
Hez to Fo. 

Further extensions to a non-linear MG scheme for solids were considered 
in Ref. [WW*05], where pseudo potentials have been used to get rid of the 
core electrons. Therefore it was possible to treat the KS equations in three 
dimensions on a finite difference grid. The non-linear MG proved to be quite 
favorable in order to cut down the number of updates of the potentials. On 
the other hand it has already been found in 2d molecular calculations, which 
used a linear MG, that the self-consistent-field iterations could be favorably 
coupled with the MG iterations and thus the gain by a non-linear MG would 
not have been dramatic. 

Another application of MG methods concerns the HF equation [BHK06] 
which differs from Eq. (B.2) in such a way that the single particle Hamiltonian 
is now orbital dependent via the non-local exchange potential which replaces 
the local Vere of the density functional. Otherwise things work rather simi- 
larly, but of course the non-local HF equation requires a considerably more 
complicated data handling in a MG scheme. The convergence properties are 
better than in density functional calculations, because the HF exchange po- 
tential does not have the p!/? behaviour which is non-analytic for p — 0 which 
happens exponentially for r — oo. 


C Relativistic aspects of electronic structure calculations 


The non-relativistic treatment of many-electron systems misses essential 
physics the heavier the atoms become. Therefore an extension to a relativistic 
density functional description was done and pursued in the traditional 4-spinor 
approach using FEM, after a relativistic linear combination of atomic orbitals 
(LCAO) method had already been used since the early 80ties. Many problems 
arose, unknown from non- relativistic approaches; they were finally traced to 
the fact that the relativistic problem always has as solutions not only scatter- 
ing states in the positive continuum of the spectrum and the discrete bound 
states, but also positronic states in the negative continuum below -mc?. 

The Dirac equation, which properly describes relativistic physics, can be 
solved rather optimal for atoms on a logarithmic mesh ranging from some 
small radius rp to some rmaz, big enough not to cut off too much from the 
exponential tails of the solutions. Usually, the innermost region from r=0 to 
ro is treated analytically with a Taylor series approximation plus singular 
factors, in order to account for the singular behaviour for point nuclei. The 
wavefunction with total angular momentum j = 1/2 (generally for j = (2n + 
1)/2 its nth derivative) diverges with non-integer powers at nuclei, unlike 
the non-relativistic case where only cusps appear. Alternatively, nuclei can be 
treated as extended objects which, however, leads to other kinds of difficulties 
concerning the regularity of the wavefunction at the boundary of the nucleus. 
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If more than one atom is involved (molecules, cluster, solids, plasmas, etc.) 
one is faced with the problem of different scales defined by different atoms. 
In a stationary problem this may be handled by different scales at the known 
nuclear sites which can be done, but requires high effort. However in a non- 
stationary case these centers are moving and this has to be accounted for by 
a changing map of scales. This is a rather hopeless task except that physics 
allows a separation into localized and non-localized contributions where the 
localized terms have to be moved with the nuclei whereas for the non-localized 
parts a simple multiscale treatment operating on a fixed multi-level space with 
time evolving expansion coefficients suffices. Such a scheme consists of the 
combination of analytical manipulations with fully numerical ones, which in a 
rather simple two atomic case, using the Dirac-Fock Slater density functional, 
has already been realized in a “stationary description”, see Ref. [DHK98] for 
further details. 

The Dirac equation results from a variation of the relativistic energy func- 
tional under the contraint of norm conservation of the wavefunction. Unfor- 
tunately there exists no simple way to separate minimum from stationary 
solutions. In fact they always mix to some extend and these admixtures may 
be identified as spurious contributions of the negative (positronic) contin- 
uum to the electronic states one is interested in. This used to be an almost 
unsurmountable problem for approximate solutions of the Dirac equation, ex- 
cept for atoms where the asymptotic analytical behaviour of bound states is 
known. For molecules, the recently developed relativistic minimax functionals 
[ES99, DESOO, ZKK04, ZK*05, KKR04, ZKK06] provide a loophole from this 
dilemma. It was a big step ahead that one could prove the boundedness of the 
minimax functional from below for any sufficiently smooth square integrable 
(i.e. normalizable) function [DE*00] and thus may obtain the best solution in 
an approximation space by minimization. 

For the construction of the eigenvalues of an operator H which has a 
gap (here —mc? till +mc?) in its continuous spectrum and is unbounded from 
above and below, the minimax principle can be expressed through the Raleigh 
quotient . 

Àk = inf sup io Lat |e) Eca ae 
gemG $70 (¢ | 4) 
G subspace of F}  g¢(G@F_) 
where F} © F_ is an orthogonal decomposition of a well-chosen space of 
smooth square integrable functions. It has been proven in Refs. [ES99, DESOO, 
DE*00, DES03] that the sequence of minimax energies Ax equals the sequence 
of eigenvalues of Å in the interval (—mc?,-+mc?). There is however a price to 
pay, namely the equations become non-linear in the eigenvalues. This causes 
a severe completeness problem of the spectrum, as there is usually not known 
a priori how many solutions are below a given energy. In the worst case one 
has to construct one solution after the other starting from the lowest energy 
solution. This at least can be formulated in a stable and absolutely safe way 
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(ZK K04]. However it turned out to be possible to derive a very accurate linear 
approximation (LARM) [ZK*05] by doubling the variational parameter space. 
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Summary. This review is concerned with three classes of quantum semiconductor 
equations: Schrödinger models, Wigner models, and fluid-type models. For each 
of these classes, some phenomena on various time and length scales are presented 
and the connections between micro-scale and macro-scale models are explained. We 
discuss Schrédinger-Poisson systems for the simulation of quantum waveguides and 
illustrate the importance of using open boundary conditions. We present Wigner- 
based semiconductor models and sketch their mathematical analysis. In particular 
we discuss the Wigner-Poisson-Focker-Planck system, which is the starting point 
of deriving subsequently the viscous quantum hydrodynamic model. Furthermore, a 
unified approach to derive macroscopic quantum equations is presented. Two classes 
of models are derived from a Wigner equation with elastic and inelastic collisions: 
quantum hydrodynamic equations and their variants, as well as quantum diffusion 
models. 


1 Introduction 


The modern computer and telecommunication industry relies heavily on the 
use of semiconductor devices. A very important fact of the success of these 
devices is that their size can be very small compared to previous electronic 
devices (like the tube transistor). While the characteristic length of the first 
semiconductor device (a germanium transistor) built by Bardeen, Brattain, 
and Shockley in 1947 was 20um, the characteristic size has been decreased up 
to now to some deca-nanometers only. With decreasing device length quan- 
tum mechanical effects are becoming more and more important in actual 
devices. In fact, there are devices, for instance tunneling diodes or quantum 
wave guides, whose function is based on quantum effects. The development 
of such devices is usually supported by computer simulations to optimize the 
desired operating features. Now, in order to perform the numerical simula- 
tions, mathematical equations are needed that are both physically accurate 
and numerically solvable with low computational cost. 
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We wish to model the flow of electrons in a semiconductor crystal with the 
goal to predict macroscopically measurable quantities by means of computer 
simulations. Although the physical process is always the transport of charged 
particles in a solid crystal, we need to devise different mathematical models 
because of the wide range of operating conditions and the desired need of ac- 
curacy. Moreover, since in some cases we are not interested in all the available 
physical information, we also need simpler models which help to reduce the 
computational cost in the numerical simulations. 

We shall discuss three model classes: Schrödinger, Wigner, and fluid-type 
models. Schrödinger models describe the purely ballistic transport of electrons 
and holes, and they are employed for simulations of quantum waveguides and 
nano-scale semiconductor heterostructures. As soon as scattering mechanisms 
(between electrons and phonons, e.g.) become important, one has to resort 
to Wigner function or the equivalent density matrix formalism. For practical 
applications Wigner functions have the advantage to allow for a rather simple, 
intuitive formulation of boundary conditions at device contacts or interfaces. 
On the other hand, the Wigner equation is posed in a high dimensional phase 
space which makes its numerical solution extremely costly. As a compromise, 
fluid-type models can provide a reasonable approximation and is hence often 
used. Since one only computes the measurable physical quantities in these 
fluid models, they are computationable cheap. Moreover, classical boundary 
conditions can also be employed here. 

The multi-scale character in semiconductor device modeling becomes man- 
ifest in a hierarchy of models that differ in mathematical and numerical com- 
plexity and incorporate physical phenomena on various time and length scales. 
The microscopic models clearly include the highest amount of information, but 
they involve the highly oscilatory Schrodinger and Wigner functions. How- 
ever, the macroscopic variables of interest for practitioners (like particle and 
current densities) are typically much smoother. Hence, it is very attractive 
(particularly with respect to reduce numerical costs) to settle for simplified 
macroscopic quantum transport models. Scaling limits allow to relate these 
models and to obtain important information for the range of validity (and 
the limitations) of the reduced macro-scale models. Starting from Wigner- 
Boltzmann-type equations it is indeed possible to obtain a unified derivation 
of quantum hydrodynamic and quantum diffusion models. 


2 Microscopic picture I: Schrödinger models 


This section is concerned with Schrddinger-Poisson models for semiconductor 
device simulations. Such models are only applicable in the ballistic regime, 
i.e. to devices or subregions of devices, where the quantum mechanical trans- 
port is the dominant phenomenon and scattering plays only a minor role. 
As particular examples we name interferences in quantum waveguides, the 
tunneling through nano-scale semiconductor heterostructures (in a resonant 
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tunneling diode, e.g.), and the ballistic transport along nano-size channels of 
MOSFETs. In all of these examples we are dealing with open quantum sys- 
tems, refering to a model on a finite geometry along with open boundaries (this 
contrasts with the situation in Sect. 3.1, where we shall consider collisional 
open quantum systems). Here, the transport model (the Schrödinger equa- 
tion, e.g.) is posed on a finite domain 2 C R? (d being the space dimension). 
At the device contacts or interfaces open boundary conditions are specified, 
such that an incoming current can be prescribed and outgoing electron waves 
will not be reflected at such boundaries. 


2.1 Quantum waveguide simulations 


In this subsection we discuss simulation models for quantum waveguides. 
These are novel electronic switches of nanoscale dimensions. They are made of 
several different layers of semiconductor materials such that the electron flow 
is confined to small channels or waveguides. Due to their sandwiched structure 
the relevant geometry for the electron current is essentially two dimensional. 
Figure 2.1 shows the example of a T-shaped quantum interference transistor. 
The actual structure can be realized as an etched layer of GaAs between two 
layers of doped AlGaAs (cf. [Ram02]). Applying an external potential at the 
gate (i.e. above the shaded portion of the stub, the “allowed region” for the 
electrons, and hence the geometry (in particular the stub length) can be mod- 
ified. This allows to control the current flow through such an electronic device. 
It makes it a switch, which resambles a transistor — but on a nano-scale. Such 
a device shows sharp peaks in conductance that are due to the presence of 
bound states in the stub (see Fig. 2.2, 2.3). It is expected that these novel 
devices will operate at low power and high speed. 

The electron transport through a quantum waveguide can be modeled 
in good approximation by a two dimensional, time dependent Schrödinger- 
Poisson system for the wave functions 7) (x, t), indexed by the energy variable 
A € ACR. The (possibly time-dependent) spatial domain Q C R? consists 
of (very long) leads and the active switching region (e.g. T-shaped as in Fig. 
2.1). In typical applications electrons are continuously fed into the leads as 
plane waves y$”. The Schrödinger model now reads 


Opn 


1 


The potential V = Ve + V; consists of an external, applied potential Ve and the 
selfconsistent potential satisfying the Poisson equation with Dirichlet bound- 
ary conditions: 
-AV;(z,t) = nlet) = f aD PIOA seer e3 
A 
V;=0, on dQ. 
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Fig. 2.1. T-shaped geometry 2 C R? of a quantum interference transistor with 
source and drain contacts to the left and right of the channel. Applying a gate 
voltage above the stub allows to modify the stub length from Lı to L2 and hence 
to switch the transistor between the on- and off-states. 


Here, n is the spatial electron density and g(A) denotes the prescribed statis- 
tics of the injected waves (Fermi-Dirac, e.g.). In the simplest case (i.e. a 1D 
approximation) open or “transparent” boundary conditions at the contacts or 
interfaces take the form 


Eu a vy”) TE —e7 7/4/84 (ha vr”), AE A, (2.3) 


where 7 denotes the unit outward normal vector at each interface. /0; is 
the fractional time derivative of order E, and it can be rewritten as a time- 
convolution of the boundary data with the kernel t~3/?. For the derivation of 
the 2D-variant of such transparent boundary conditions and the mathematical 
analysis of this coupled model (2.1)-(2.3) we refer to [BMP05, Ar01]. 

The discretization of such a model poses several big numerical challenges, 
both for stationary and for transient simulations: Firstly, the wave function is 
highly oscillatory for larger energies, while the macroscopic variables of inter- 
est (particle density n and potential V) are rather smooth. Secondly, solutions 
to (2.1)-(2.3) can exhibit sharp peaks in the curve of conductance versus injec- 
tion energy (both in quantum waveguides and in resonant tunneling diodes). 
To cope with these two problems, WKB-type discretization schemes for the 
1D stationary analogue of the above model and adaptive energy grids (for 
À € A) were devised in [BP06]. 
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Fig. 2.2. Stationary Schrodinger wave function for T-shaped waveguide with short 
stub (i.e. Lı = 32 nm) — “off state” 


Thirdly, the numerical discretization of the transparent boundary condi- 
tion (2.3) is very delicate in the time dependent case, as it may easily ren- 
der the initial-boundary value problem unstable and introduce hugh spuri- 
ous wave reflections. Based on a Crank-Nicolson finite difference discretiza- 
tion of the Schrödinger equation, unconditionally stable discrete transparent 
boundary conditions were developed in [Arn98] for the one-dimensional and in 
[AES03, AS06] for the two-dimensional problem. An extension of such discrete 
open boundary conditions for (multiband) kp-Schrédinger equations appear- 
ing in the simulation of quantum heterostructures were developed in [ZA*05]. 

To close this subsection we present some first simulations of the electron 
flow through the T-shaped waveguide from Fig. 2.1 with the dimensions X = 
60 nm, Y; = 20 nm. These calculations are based on the linear Schrodinger 
equation for one wave function with V = 0 and the injection of a mono- 
energetic plane wave with 130 meV, entering in the transparent boundary 
condition (2.3). The simulation was based on a compact forth order finite 
difference scheme (“Numerov scheme”) and a Crank-Nicolson discretization 
in time [ASO6]. 

Important device data for practitioners are the current-voltage (I-V) char- 
acteristics, the ratio between the on- and the (residual) off-current as well as 
the switching time between these two stationary states. Depending on the size 
and shape of the stub, the electron current is either reflected (off-state of the 
device, see Fig. 2.2) or it can flow through the device (on-state, see Fig. 2.3). 
Starting from the stationary state in Fig. 2.2, the swiching of the device was 
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Fig. 2.3. Stationary Schrödinger wave function for T-shaped waveguide with long 
stub (i.e. L2 = 40.5 nm) — “on state” 


realized by an instantaneous extension of the stub length from Lı = 32 nm to 
Lə = 40.5 nm. After a transient phase the new steady state Fig. 2.3 is reached 
after about 4 ps. 


3 Microscopic picture II: Wigner models 


In this section we shall present and discuss semiconductor models that are 
based on the quantum-kinetic Wigner formalism. As mentioned before there 
are two main reasons for using this framework in applications (indeed, mostly 
for time dependent problems): In contrast to Schrödinger models the Wigner 
picture allows to include the modeling of scattering phenomena in the form 
of a quantum Boltzmann equation. Secondly, this quantum-kinetic framework 
makes is easier to formulate reasonable boundary conditions at the device 
contacts, using guidance and inspiration from classical kinetic theory [Fre87]. 
This approach makes indeed sense, as quantum effects are not important close 
to the (typically highly doped) contact regions. 

The Wigner function f = f(x, v,t) is one of several equivalent formalisms 
to describe the (mixed) state of a physical quantum system (cf. [Wig32]). It 
is a real-valued quasi-distribution function in the position-momentum (zx, p) 
phase space at time t. In collision-free regimes, the quantum equivalent of the 
Liouville equation of classical kinetic theory governs the time evolution of f. 
In the d-dimensional whole space it reads 
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ftp: Vaf +0V]f = 0, t>0, (2,p) ER”, (3.1) 


where the (real-valued) potential V = V(x) enters through the pseudo- 
differential operator 6[V] defined by 


OVINE Dt) = Aare f AV Fontem te” dn 


l 


= J SV (z,n) f(x, p, t) P=) dp dn. (3.2) 
(27)? Jra Jra 
Here, 6V(x,n) := + [V(x + $) — V(x — $)], and Fp—n f denotes the Fourier 
transform of f with respect to p, and € > 0 is the scaled Planck constant. 
For semiconductor device simulations it is crucial to include the self- 
consistent potential. The electrostatic potential V (x, t) is hence time-dependent 
and obtained as a solution to the Poisson equation 


A,V(a,t) = n(x, t) — C(x), (3.3) 


with the particle density 
nt) = | flep t) dp. (3.4) 
R 


Here, C(x) denotes the time-independent doping profile of the device, i.e. the 
spatial density of the doping ions implanted into the semiconductor crystal. 


3.1 Wigner models for open quantum systems 


Until now we only considered the ballistic and hence reversible quantum trans- 
port of the electrons in a one-particle (Hartree) approximation. Such purely 
ballistic models (either in the Wigner or Schrödinger framework - cf. Sect. 2) 
are useful semiconductor models for device lengths in the order of the elec- 
trons’ mean free path. For larger devices, however, scattering phenomena be- 
tween electrons and phonons (i.e. thermal vibrations of the crystal lattice) or 
among the electrons have to be taken into account. In this case an appropri- 
ate collision operator Q(f) has to be added as a right hand side of (3.1). In 
contrast to Sect. 2 the term open quantum system refers here to the interac- 
tion of our considered electron ensemble with some ‘environment’ (an external 
phonon bath, e.g.) and not to the influence of the boundaries or contacts. 
For the classical semiconductor Boltzmann equation excellent models for 
the most important collisional mechanisms have been derived (cf. [MRS90]) 
and are incorporated into today’s commercial simulation tools. In quantum 
kinetic theory, however, realistic and numerically usable collison models are 
much less developed. In contrast to classical kinetic theory, quantum collision 
operators are actually non-local in time (cf. the Levinson equation [Lev70] as 
one possible model). However, since most of the current numerical simulations 
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involve only local in time approximations, we shall confine our discussion to 
such collision operators. The three most used models are firstly relaxation 
time approximations of the form 
fo- f 

Q2; (3.5) 
with some appropriate steady state fo and the relaxation time 7 > 0 [KK*89, 
Arn95a]. Secondly, many applications (cf. [Str86, GG*93, RA05]) use quantum 
Fokker-Planck models: 


Q(f) = Gdivp(pf) + cA, f + 2ydivp(V2f) + aAcf, (3.6) 


(cf. [CL83, CE*00] for a derivation) with the friction parameter 3 > 0. The 
non-negative coefficients a, y, g constitute the phase-space diffusion matrix of 
the system. We remark that one would have a = y = 0 in the Fokker-Planck 
equation of classical mechanics [Ris84]. 

As a third option, the Wigner equation (3.1) is often augmented by a semi- 
classical Boltzmann operator [KN05]. However, since this model is quantum 
mechanically not consistent, we shall not discuss it any further. Finally, we 
mention the quantum-BGK type models [DR03] that were recently introduced 
for deriving quantum hydrodynamics (cf. Sect. 4.2 for details). 

At the end of this section we briefly list the numerical methods developed 
so far for Wigner function-based device simulations. Virtually all simulations 
were carried out for one dimensional resonant tunneling diodes. The earli- 
est approaches were based on finite difference schemes for the relaxation-time 
Wigner-Poisson system [Fre87, KK*89]. Spectral collocation methods were de- 
signed as an efficient alternative to discretize the non-local pseudo-differential 
operator @[V] (cf. [Rin90]). In [AR95] this approach was combined with an 
operator splitting between the transport term p- Vy and [V] which has 
also been common practice for Boltzmann type equations. This mixed op- 
erator splitting/spectral collocation technique was recently extended to the 
Wigner-Fokker-Planck system in [DA06]. In [KN05] the classical Monte Carlo 
method was extended to Wigner models, and it has the potential to make 
multi-dimensional simulations feasible. Since the Wigner function takes both 
positive and negative values, novel algorithms for particle creation and anni- 
hilation had to be developed within this Monte Carlo approach. 


3.2 Open quantum systems in Lindblad form 


Since the Wigner function takes also negative values, it is a-priori not clear 
why the macroscopic particle density satisfies n(a,t) := f f(x,p,t)dp > 
0 Yz, t. This physically important non-negativity is a consequence of the non- 
negativity of the density matrix (operator) that is associated with a Wigner 
function: Let p be an operator on L?(R¢) with integral kernel p(x, x’) = p € 
L?(R?4), ice. 
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(POE) = | Pære) v6 € PRÀ. (3.7) 
Rd 


The Wigner transform of the density matrix p is now defined as the following 
Wigner function f (cf. [Wig32, LP93]): 


WAE = Her) = Goa f olet gma Fn)" dn, (88) 


In terms of density matrices, a mixed quantum states is described as a 
positive trace class operator on L?(R%) (i.e. P € Zi, P > 0), mostly with the 
normalization Trp = 1, where Tr denotes the operator trace. The positivity 
of p as an operator then implies pointwise positivity of the particle density 


n(x) := p(x,2)>0, x €R%, (3.9) 


and of the corresponding kinetic energy 


2 2 
Eum = 5 f, IPP f(ep) dedp = ETA) = 1 V=Apv=A) > 0. 
R24 


2 
(3.10) 
The time evolution of a density matrix is given by the Heisenberg-von 
Neumann equation, obtained by applying the Wigner transform (3.7), (3.8) 
to the Wigner equation (3.1). It reads 


icp, = H(t) — PH (t), t>0, 
Alt = 0) = pr, a 
with the (possibly time dependent) Hamiltonian H(t) = -ŞA +V (x,t). For 
an open quantum system the right hand side of (3.11) has to be augmented 
by a non-Hamiltonian term iA(p). It is well known from [Lin76] that such 
quantum evolution equations preserve the positivity of p(t) (more precisely, 
it is actually the complete positivity) if and only if the dissipative term A(p) 
satisfies the following structural condition. It must be possible to represent it 
in the so-called Lindblad form: 


A) = X LS Lip + pL Lj — 2L;pL*, (3.12) 
jEJ 


with some appropriate (but typically not uniquely defined) Lindblad operators 
Lj, and a finite or infinite index set J C N. Furthermore, such models then 
preserve the mass of the system, i.e. Tr p(t) = Tr Pr, t > 0. 

For the relaxation time Wigner equation we have A(p) = Ê one with some 
steady state fo. Under the natural assumption Tr pr = Tr fo = 1, and if the 
relaxation time 7 is constant, we have (cf. [Arn95b}) 





~N 1 ~ a ~ A * =~ aT * aT * 
ACP) = = (60 Trp — P Tr Po) = XO Li, Lin + PL,L jn — 2LjnPL*,,, 
j,keN 
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with the Lindblad operators Lj, = \/ur/T |k >< yzl. Here, (uk, Ye) ren 
denotes the eigenpairs of po. Hence, the relaxation time Wigner equation 
(possibly with a selfconsistent potential) is an admissible open quantum model 
in Lindblad form. 

For the Wigner-Fokker-Planck (WFP) equation with Q(f) from (3.6), the 
Lindblad condition (3.12) holds iff 


a y+Ëß 
Gee a ) > 0. (3.13) 


on 


Under this assumption the WFP model is also quantum mechanically cor- 
rect. Here, the Lindblad operators are linear combinations of zj and ôs; 
(cf. [AL*04]). 


3.3 Analysis of the Wigner-Poisson-Fokker-Planck system 


In this section we will sketch the different approaches to the well-posedness 
analysis for the Cauchy problem of the Wigner-Poisson-Fokker-Planck (WPFP) 
system in three dimensions: 


Of +p-Vaef + OV] f = Bdivy(pf) + coApf + 27divp(Vaf) +oAcf, t > 0, 
f(ept=0) = fr(z,p), (ap) € Rf, (3.14) 


AsV (x,t) = n(x, t) = ie w(x, p, t) dp. 


For simplicity we set here all physical constants equal to 1, and we chose 
C = 0 as this would not change the subsequent analysis. Also, the Lindblad 
condition (3.13) is assumed to hold in the sequel. 

First we remark that the WPFP model cannot be writen as an equiva- 
lent system of countably many Schrodinger equations coupled to the Poisson 
equation (and this is typical for open quantum systems). Therefore, the ap- 
proach of [BM91] employed in the well-posedness analysis of the (reversible) 
Wigner-Poisson system cannot be adapted to WPFP. Hence, there are two re- 
maining frameworks for the analysis: the Wigner function and density matrix 
formalisms, which we shall both briefly discuss here. 

On the quantum kinetic level there are two main analytic difficulties for 
the nonlinear WPFP system. Since the natural analytic setting for Wigner 
functions is f(.,.,¢) € L?(R6) we cannot expect uy f(z, t) € a holds. 
Hence, the definition of the particle density by n(x, t) = fall z, p, t) dp is 
purely formal. The second key problem is the lack of F a-priori estimates 
on the Wigner function which would be needed to prove global-in-time exis- 
tence of WPFP-solutions: The only useful (and almost trivial) estimate is 


FOl) < eF*|Ifrl|z2qesy, t> 0. (3.15) 


The other physically obvious conservation laws 
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l f(a, p,t)dzdp = const (mass conservation), 


and a simple energy balance involving the kinetic energy Erin = 4 f |p|? 
f(x, p,t) dzdp both include functionals of f that are, a-priori, not necessarily 
positive and hence not useable on the quantum kinetic level. 


Dispersive effects in quantum kinetic equations. Both of the described 
analytic problems — proper definition of the particle density n (or, equivalently, 
the electric field E = V, V) and additional a-priori estimates — can be coped 
with by exploiting dispersive effects of the free-streaming operator jointly with 
the parabolic regularization of the Fokker-Planck term. Such dispersive tech- 
niques for kinetic equations were first developed for the Vlasov-Poisson system 
(cf. [Per96]) and then adapted to the Vlasov-Poisson-Fokker-Planck equation 
n [Cas98]. In [ADM04, ADMO5] these tools were extended to quantum ki- 
netic theory. They yield first of all an a-priori estimate for the field E(t) in 
terms of || f(t)||z2(rR6) only (remember (3.15) !). This estimate allows a novel 
definition of the macroscopic quantities (namely, the self-consistent field, the 
potential, and the density), which, in contrast to the definition (3.3), (3.4) is 
now non-local in time. This way, no p-integrability of f is needed. 

Next we illustrate these dispersive tools in some more detail. With G(t) = 
G(a,p,2z’,p’,t) denoting the Green’s function of the linear part of (3.14) 
(cf. [SC*02]), the (linear) WFP equation can be rewritten as 


Teni j= ffewn EE dp! (3.16) 


+ [ff eovowine.r.t— sardas, £>0. 


According to the two terms on the r.h.s. we split the electric field 


E(a,t) = VV (x,t) = I aut E « n(x, t) 
into E = Eo + Fy with 
Che are J | GON pdz dp! dp, (3.17) 
Ar|] 


mene T i. f ' / | | G(s)(6[V]f)(2',p',t — 8) dx'dp'dpds. 


With some tricky reformulation this last equation can be rewritten as 
Ve t) (3.18) 


1 Z 3zj£k — djx|0l = Sjrlal? x 
= xz F] x,t, s) ds, 
me eis 0) ae 
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j =1,2,3; with 
Fr[f](a, t, 8) = f (DE + BN) (x — 0(s)p,p,t—s)dp, k=1,2,3, 


N (a) = (27)? exp (-££) 


— eB 
O(t) = A ot) := t, if B=0, 
—Bt _ 9-28 Z —B = 
R(t) =2at +o (£ t ae 2) +a (£ ai *), 


and the (vector valued) pseudo-differential operator T [E] is related to [V] by 


Notice that (3.18) is a closed equation (more precisely a linear Volterra inte- 
gral equation of the second kind) for the self-consistent electric field E, € R3, 
for any given Wigner trajectory f € C((0, T]; L?(R6)). 


These motivations lead to our new definition of the Hartree-potential: 
Definition 3.1 (New definition of mean-field quantities) To a Wigner 
trajectory f € C([0, T]; L7(IR°)) we associate 


e the field E|f] := Eo + Eı[f], with Eo given by (3.17), and E;[f] the unique 
solution of (3.18), 
e the potential V[f] := Vo + Vi[f] with 














Volx, t) := D BE (Eo), (x,t), (3.19) 
Vilf](z, t) = tye BE (Elf FD: (x,t), (3.20) 
e and the position density n| f] := divE|f] (at least in a distributional sense). 


In contrast to the standard definitions (3.3), (3.4), these new definitions 
are non-local in time. Also, the map f ++ V[f] is now non-linear. For a given 
Wigner trajectory these two definitions clearly differ in general. However, they 
coincide if f is the solution of the WPFP system. These new definitions of the 
self-consistent field and potential have the advantage that they only require 
f € C((0, 7); L7(R®°)) and not f(z,.,t) € L'(R3). If f(t = 0) only lies in 
L? (R), the corresponding field and the potential will consequently only be 
defined for t > 0. 

The equation (3.18) now easily yields the announced || E(t)||,2(gs) estimate 
for t € (0,00) in terms of fr and ||f(t)||z2s) only. The fixed-point map 
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f — V[f] > f (where the last steps refers to solving the linear WFP equa- 
tion (3.16) with given V(t), t > 0) is now contractive in C((0, T]; L7(IR®°)) and 
it yields the global mild solution for the WPFP system (3.14). 


Without going into details we briefly list alternative kinetic approaches 
for the WPFP system that were developed in the last few years: In [ADM04] 
p-weighted L?-spaces were used to make the definition of the particle den- 
sity by n = f fdp meaningful. In [AL*04, CLN04], instead, an L'-setting 
is chosen with the same motivation. The pseudo-differential operator [V] is 
rewritten there as a convolution operator in the p-variable. We remark that 
such kinetic strategies are valuable as they can, possibly, be extended to the 
WPFP boundary value problems used for semiconductor device modeling. 


The quantum Fokker-Planck system for density matrices. Using the 
Wigner transforms we first rewrite the WPFP system (3.14) for the integral 
kernel p(x, x’, t) from (3.7): 


pt = —t1Hyp + tH p — Pie — x’): (Vz — Vx’)P (3.21) 





+ a| Vz + Val? p — ola — x'|?p + 2iy(x — 2") - (Va + Ver)p, 


coupled to the Poisson equation for V, where H, is a copy of the Hamiltonian 
H =H, =—4$A,+V(z,t), but acting on the «’—variable. The corresponding 
density matrix p then satisfies the evolution equation (3.11), augmented with 
ar.h.s. 7A(p) in Lindblad form (3.12) and coupled to the Poisson equation. 

For the whole space case the density matrix formalism provides the most 
elegant analytic setup. Motivated by the kinetic energy Exin(?) defined in 
(3.10) we define the “energy space” 


E= {p Tı | v1 Apv 1 A Ty}. 


For physical quantum states (i.e. P > 0) we then have 





lâle = Tr p+ Exin(). 


The simple estimate 
IInIlz2¢@R) < Alz 


gives a rigorous meaning (in L!) to the definition of the particle density (3.9), 
and the Lieb-Thirring-type estimate (cf. [Arn95a, LP93]) yields 


In ||z3¢R3) < Cllplle- 


Therefore the nonlinearity [V (x,t) —V(2’, t)] p(x, 2’, t) in (3.21) is locally Lip- 
schitz in €. Since the linear part of (3.21) generates a mass conserving semi- 
group on €, standard semigroup theory yields a unique local-in-time solu- 
tion to (3.21). A-priori estimates on the mass Tr (p) and the total energy 
Etot(P) = Exin(P) + $||VaV [A]|l7Z2, due to the energy balance 
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d A 

gto = 30 Tr pr — 2p Ekin (t) — al|n(t)|IZ2, 

then shows that there exists a unique global mild solution of (3.21) in 
C([0, 00); €) (ef. [AS04]). 


4 Macroscopic picture: fluid-type models 


The aim of this section is to derive macroscopic quantum models from the 
following Wigner-Boltzmann equation for the distribution function f(z, p, t): 


Oaft+p:Vef +9V]f = Q(f), f(z, p,0) = fr(x, p), (x,p) € R”, P 
4.1 

Here, (x, p) denotes the position-momentum variables of the phase space, t > 0 
is the time, d > 1 the dimension, Q(f) a collision operator, and 6[V]f is the 
pseudo-differential operator defined by (3.2). Notice that in the semi-classical 
limit € — 0, the term 6[V]f converges to V;V-V,f and thus, (4.1) reduces to 
the semi-classical Vlasov equation [MRS90]. The electric potential V = V (x, t) 
is selfconsistently coupled to the Wigner function f via Poisson’s equation 


AZ AV = Í, fdp- C, (4.2) 


where Az is the scaled Debye length and C = C(x) the doping concentration 
characterizing the semiconductor device [Jün01]. 

In classical fluiddynamics, macroscopic models can be derived from the 
Boltzmann equation by using a moment method. The idea is to multiply the 
kinetic equation by some monomials «;(p) and to integrate the equation over 
the momentum space. This yields the so-called moment equations. Usually, 
not all integrals can be expressed in terms of the moments (which is called 
the closure problem) and an additional procedure is necessary in order to 
close the equations. Depending on the number of moments which are taken, 
a variety of fluiddynamical models can be derived [BD96, Lev96]. The aim of 
this section is to mimic this procedure in the quantum case. Figure 4.1 shows 
the resulting models arising from special choices of the set of monomials. We 
will discuss these models in detail in the following subsections. For this, we 
need to specify the collision operator Q(f) in (4.1). First we introduce in the 
following subsection the so-called quantum Maxwellian. 


4.1 Definition of the quantum Maxwellian 


In order to define the quantum Maxwellian, we use the Wigner transform 
W(P) of an integral operator p on L?(R®) as defined in (3.7), (3.8). Its inverse 
W-—!, also called Weyl quantization, is defined as an operator on L?(R®): 


Multi-Scale Modeling of Quantum Semiconductor Devices 345 





Wigner-Boltzmann 


moment method, moment method, 
entropy minimization, fran Enskog 
= == ote ee eee ee, 


4 fr eee 
quantum moment hydrodynamic quantum moment diffusion models 















































models, (Pp) = (K9(P),...;Kjy(P)) K (P) = (Ko(P),...,Kyy(P)), K(P) even 
«(p) = (1,p,|PpI?/2) «(p) = (1,1p|7/2) 
quantum hydrodynamic quantum energy-transport 
equations equations 
k(p) = (1,p) k(p) = (1) 
isothermal quantum hydrodynamic quantum drift-diffusion equations 
equations 
L al Ls za 
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(W-(f)e)(«) = | f (=) ol)er ©" dpdy for all $ € LR"). 
R24 

With these definitions we are able to introduce as in [DR03] the quantum 

exponential and the quantum logarithm formally by Exp f = W (exp W~!(f)) 

and Log f = W (log W~*+(f)), where exp and log are the operator exponential 

and logarithm, respectively. 

Inspired by Levermore’s moment method for the classical case [Lev96], 
Degond and Ringhofer [DR03] have defined the quantum Maxwellian by us- 
ing the entropy minimization principle. Let a quantum mechanical state be 
described by the Wigner function f solving (4.1). Then its relative quantum 
(von Neumann) entropy is given by 


2 
H(A) = | Fep ( (Los Pep) -1+ BE + v(e,.)) dea. 
Whereas the classical entropy is a function on the configuration space, the 
above quantum entropy at given time is a real number, underlining the non- 
local nature of quantum mechanics. 

We define the quantum Maxwellian My for some given function f(z, p, t) 
as the solution of the constrained minimization problem 


H(My) = min fach : | f(a, p, t)«i(p)dp = m;(x,t) for all z, t, i}, (4.3) 
Rd 
where «;(p) are some monomials in p and m,(z,t) are the moments of f, 
mi(x,t) = (f(z,p,t)Ki(p)), 1=0,...,N, (4.4) 
where we have used the notation (g(p)) = f g(p)dp for functions g(p). The 


formal solution of this minimization problem (if it exists) is given by My = 
Exp (A:«— $|p|?—V(a,-)), where s = (Ko,..., KN), and A = (Ao,..., Aw) are 
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some Lagrange multipliers. If xo(p) = 1 and k2(p) = $|p|, setting Ao = =v, 


Ag = Ny — 1 and à; = A; otherwise, we can write 


My = Exp (A - K(p)). (4.5) 


4.2 Quantum moment hydrodynamic models 


In this section we will derive the quantum moment hydrodynamic equations 
from the Wigner-Boltzmann equation (4.1) with dominant elastic scattering 
employing a moment method. For a special choice of the moments, the quan- 
tum hydrodynamic equations are obtained. If a Fokker-Planck approach is 
taken for the inelastic collision operator, viscous corrections to the quantum 
hydrodynamic model are derived. 


General quantum moment hydrodynamics. Introducing the hydrody- 
namic scaling xv’ = ag, t’ = at, where 0 < a < 1 measures the typical energy 
gain or loss during an electron-phonon collision, the Wigner-Boltzmann equa- 
tion for (fa, Va) becomes (omitting the primes) 


aði fa + alp: Vafa +[Valfa) = Q(fa), (2, p,t) E€ R24 x (0,00), (4.6) 


together with an initial condition for fa and the Poisson equation (4.2) for 
Va. The collision operator is assumed to split into two parts: 


Q(fa) = Qo( fa) + 0Q1 (fa), 


where the first (dominant) part models elastic collisions and the second part 
models inelastic scattering processes. The operator Qo is supposed to satisfy 
the following properties: 


(i) If Qo(f) =0 then f= My, (ii) (Qo(f)«(p)) = 0, (4.7) 


where My is the quantum Maxwellian introduced in section 4.1 and K(p) is a 
vector of some monomials «;(p). If «(p) = (1, p, |p|?/2), condition (ii) expresses 
the conservation of mass, momentum, and energy, which is meaningful for 
elastic collisions. An example fulfilling conditions (i) and (ii) is the BGK-type 
operator [BGK54] Qo(f) = (My — f)/7 with the relaxation time 7 > 0. An 
example of an inelastic collision operator Qı will be given in the subsection 
“Viscous quantum hydrodynamic equations” below. 

In the following we proceed similarly as in [DR03]. The moment equations 
are obtained by multiplying (4.6) by «(p)/a, integrating over the momentum 
space, and using condition (ii): 


O:(K(p) fa) + dive (K(P)pfo) + (&(p)O[Val fa) = (K(P)Q1 (fo))- 


The second integral on the left-hand side of the moment equations cannot be 
expressed in terms of the moments; this is called the closure problem. We can 
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solve this problem by letting a — 0. Indeed, the formal limit a — 0 in (4.6) 
gives Qo(f) = 0 where f = lima—o fa. Hence, by condition (i), f = My. Then 
the formal limit a — 0 in the above moment equations yields 


ôm + diva («(p)pMy) + («(p)O[V] My) = (K(p)Qi(My)), (4.8) 


where m = (k(p)My) are the moments (see (4.4)) and V = lima—o Va solves 
(4.2) with f = My. The above equations have to be solved for x € R? and t > 
0, and the initial condition becomes m(-,0) = («(p)M;,). In the classical case, 
Levermore [Lev96] has shown that the moment equations are symmetrizable 
and hyperbolic. In the present situation, this concept of hyperbolicity cannot 
be used since (4.8) is not a partial differential equation but a differential 
equation with non-local operators of the type A+ (Exp (A - «(p))). 

The system (4.8) possesses the following (formal) property: If 1 and $|p|* 
are included in the set of monomials and if the inelastic collision operator 
conserves mass and dissipates energy, i.e. (Qi(f)) = 0 and (3|p|?Qi(f)) < 0 
for all functions f, the total energy 


1 2 AE 2 
E(t) = Lip 2M 2L, V]? |d 
(t) Ja (Gr s) + 5 | |" ) dzdp 


is nonincreasing. To see this, we notice that for all (regular) functions f, 
(OVIA) =0, (PO[V]f)=—(F)VaV, (slp? OV] f) = (pf) VeV. (4.9) 
From the moment equations 
(Mf) + dive(pMy) =0,  O(3\p|?My) + dive(3lp|’pMys) < (PMF): VaV 
and the Poisson equation (4.2) we obtain formally 


E 
Re 


= J omn -VaV —Va(My))de 
= J omn . VaV + Vdivs(pMș})dz = 0, 


proving the monotonicity of the total energy. 
In the following section we will specify the choice of the monomials, which 
enables us to give a more explicit expression of the system (4.8). 


Quantum hydrodynamic equations. In classical fluiddynamics, the Euler 
equations are derived from the Boltzmann equation by using the monomials 
k(p) = (1, p, |p|?/2) in the moment equations. In this subsection, we derive the 
quantum counterpart, the so-called quantum hydrodynamic (QHD) equations 
(see [DRO3, JMMO05)). 
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Let «(p) = (1,p, |p|?/2). The moments n := mo, nu := my, and ne := m2 
are called the particle, current, and energy densities, respectively. We also 
define the velocity u = nu/n and the energy e = ne/n. In this situation, the 
quantum Maxwellian can be written as My = Exp (Xo + Ai: p + A2ļp|?) or, 
equivalently, as 


My(z,t) = Exp (AG, t) — oo), (4.10) 


where A, w, and T are defined in terms of Xo, A1, and A2. In the following we 
will give a more explicit expression for the quantum moment equations (4.8). 

Using (4.9) and observing that the second and third moments can be 
written as 


(@pM;)=P+nu@u, where P=((p-u)@ p—w)M)), 
(Sp|p\?My) = S+(P+nel)u, where S = ($(p—u)|p— u|? Mp), 


the quantum moment equations (4.8) become 


On + div(nu) = (Qi(Mf)), (4.11) 
0; (nu) + div(nu ® u) + div P — nVV = (pQi(M,)), (4.12) 
d,(ne) + div((P + neL)u) + div S —nu- VV = (S|p|?Qi(Mf)), (4.13) 


where u & u denotes the matrix with components ujug, P is the stress tensor, 
S the (quantum) heat flux, and 7 is the identity matrix in R¢*¢. The electric 
potential is given by (4.2) with f = My or, in the above notation, by 


M AV =n —C(a). (4.14) 


The above system, which is solved for x € R and t > 0 with initial conditions 
for n(-,0), nu(-,0), and ne(-,0), is called the quantum hydrodynamic equations. 
The quantum correction only appears in the terms P and S. We can derive an 
explicit expression in the O(e*) approximation. For this, we need to expand 
the quantum Maxwellian My in terms of e?. As the computations are quite 
involved, we only sketch the expansion and refer to [JMM05] for details. 

The quantum exponential can be expanded in terms of e? yielding Exp f = 
ef — (e?/8)ef B + O(e*), where B is a polynomial in the derivatives of f up 
to second order. This allows for an expansion of the moments 


lp — w|? L2 
= E A- —— n i= Jap, 
(mnune)= | Bsp (a- PZL) (1,0, Sol) ap 


of the stress tensor 


2 
P=nTI+ Enf ($4 1)VlogT 8 VlogT — VlogT 8 V logn 





» RR 
— V logn ® Vlog T — (V 8 V) log(nT*) + T } (4.15) 
V logT 





$ Z Taiv(n )z + O(e4) (4.16) 
12 i 
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and of the quantum heat flux 


T {(S+ 1) RV log (=) za (5 +2)div R+ 5 Au} 


PAU T 2 
e (d n ; 4 
P (5 + 1) n{ RV log (=) + div R} + O(e*), (4.17) 


where the matrix R with components Rj; = Ou;/Ox; — Ou;/Ox; is the anti- 
symmetric part of the velocity gradient and R' is the transpose of R. In the 
semi-classical case € = 0 the stress tensor reduces to the classical expression 
P = nTI. The term S is purely quantum and vanishes if € = 0. The energy 
density is the sum of the thermal, kinetic, and quantum energy, 

d 


1 2 1 
ne = grt + 5mlul? — Enf Alogn — Ft(R' R) + {|V iog T|? — AlogT 


— VlogT - Viogn} + O(e*), (4.18) 


where “tr” denotes the trace of a matrix. 

A simplified quantum hydrodynamic model up to order O(e*) can be ob- 
tained under the assumptions that the inelastic collision part vanishes, Q1 = 0, 
that the temperature is slowly varying, V log T = O(e?), and finally, that the 
vorticity is small, R = O(e?): 


On + div(nu) = 0, (4.19) 


2 
A:(nu) + div(nu & u) + V(nT) — E div(n(V 8 V) logn) — nVV = 0, (4.20) 
2, 
O,(ne) + div((P + neL)u) — —div(n Au) — nu: VV =0, (4.21) 


with the stress tensor and energy density, respectively, 


e? d lorea nE 
P=nTI pY @V)logn, ne= ght + gnlul vide logn. (4.22) 
We notice that if we choose «(p) = (1, p), we obtain the isothermal quantum 
hydrodynamic equations (4.19)-(4.20) with constant temperature T = 1. 

The system (4.19)-(4.21) corresponds to Gardner’s QHD model except 
for the dispersive velocity term (€?/8)div(nAu). The differences between our 
QHD equations and Gardner’s model can be understood as follows. In both 
approaches, closure is obtained by assuming that the Wigner function f is 
in equilibrium. However, the notion of “equilibrium” is different. A quantum 
system, which is characterized by its energy operator W~1+(h), with the Weyl 
quantization W~! and the Hamiltonian h(p) = |p|?/2 + V(x), attains its 
minimum of the relative (von Neumann) entropy in the mixed state with 
Wigner function fg = Exp (—h/Tp). This state represents the unconstrained 
quantum equilibrium. The expansion of fg in terms of €? was first given in 
[Wig32], 


350 A. Arnold, A. Jiingel 


falz, p) = exp(—h(2, p)/To)(1 + €? fo(z,p)) + Ol") 


with an appropriate function f2. As a definition of the quantum equilibrium 
with moment constraints, Gardner employed this expansion of fg and modified 
it mimicking the moment-shift of the Gibbs state in the classical situation: 


h(x, p — u(x) 


T ) (1+ e? fo(a,p — u(x))) + O(e*). 


fa(2,p) = n(x) exp ( — 


In contrast to the classical case, fo is not the constrained minimizer for the 
relative von Neumann entropy. On the other hand, the equilibrium state Mẹ 
used here is a genuine minimizer of the relative entropy with respect to the 
given moments. It has been shown in [JMM05] that both approaches coincide 
if the temperature is constant and if only the particle density is prescribed as 
a constraint. 

Equations (4.19)-(4.22) are of hyperbolic-dispersive type, and the presence 
of the nonlinear third-order differential operators in (4.20) and (4.21) makes 
the analysis of the system quite difficult. In particular, it is not clear if the 
electron density stays positive if it is positive initially. Since the total mass 
J ndz and the total energy, 


d 1 A? Er e?d 
E(t) = (Ser nul? + &|yv |? + 24 5 CnlVlogT |? 
@) Í: Sat + 5nluP + SIVVP + Yva]? + nlV logT] 


2 

+ an tr(RTR)) dx, (4.23) 
are conserved quantities of the quantum moment equations (4.11)-(4.18) (if 
Qı = 0) [JMM05], this provides some Sobolev estimates. However, this esti- 
mate seems to be not strong enough to prove the existence of weak solutions to 
the system. Indeed, for a special model, a nonexistence result of weak solutions 
to the QHD equations has been proved [GaJ01]. This result is valid for the 
one-dimensional isothermal stationary equations, solved in a bounded inter- 
val with Dirichlet boundary conditions for the electron density and boundary 
conditions for the electric potential, the electric field, and the quantum Bohm 
potential at the left interval point. Moreover, the term Tn, in (4.20) has 
been replaced by a more general pressure function p(n) satisfying a growth 
condition. 

The nonexistence result is valid for sufficiently large current densities. On 
the other hand, for “small” current densities fulfilling a subsonic condition 
related to classical fluiddynamics, some existence results for the stationary 
and transient equations have been achieved [HLMO05, Jiin98, JL04, JMR02]. 

The QHD equations contain two parameters: the (scaled) Planck constant 
€ and the Debye length Az. In special regimes of the physical parameters, 
these constants may be small compared to one, such that the semi-classical 
limit € — 0 or the quasi-neutral limit A, — 0 may be of interest, leading to 
simpler models. In fact, the QHD equations reduce in the semi-classical limit 
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Fig. 4.2. Influence of the number of discretization points N on the current-voltage 
characteristics for Gardner’s QHD equations (left) and for the new QHD model 
(right). 


to the Euler equations. This limit has been proved in the one-dimensional 
isothermal steady state for sufficiently small current densities in [GyJ00] and 
for arbitrary large current densities (after adding an ultra-diffusive term in 
(4.20)) in [GaJ02]. The quasi-neutral limit in the isentropic QHD model has 
been performed in [LL05], showing that the current density consists, for small 
Debye length, of a divergence-free vector field connected with the incompress- 
ible Euler equations and a highly oscillating gradient vector coming from high 
electric fields. 

The nonisothermal QHD equations have been first solved numerically by 
Gardner using a finite-difference upwind method, considering the third-order 
term as a perturbation of the classical Euler equations [Gar94]. However, 
hyperbolic schemes have the disadvantage that the numerical diffusion may 
influence the numerical solution considerably [JT06]. This can be seen in 
Fig. 4.2 (left). The figure shows the current-voltage characteristics of a one- 
dimensional resonant tunneling diode, computed from the QHD equations 
(4.19)-(4.21) without the dispersive velocity term but including heat conduc- 
tivity and relaxation-time terms of Baccarani-Wordeman type. The tunneling 
diode consists of three regions: the high-doped contact regions and a low- 
doped channel region. In the channel, a double-potential barrier is included 
(see [JMM05] for the physical and numerical details). 

Due to the numerical viscosity introduced by the upwind method, the solu- 
tion of Gardner’s model strongly depends on the mesh size. On the other hand, 
the solution to the new QHD equations (4.19)-(4.21) presented in [JMMO05] is 
much less mesh depending (see Fig. 4.2 right). 

Notice that the main physical effect of a tunneling diode is that there 
exists a region in which the current density is decreasing although the voltage 
is increasing. This effect is called negative differential resistance and it is 
employed, for instance, to devise high-frequency oscillator devices. 
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The effect of the dispersive velocity term is a “smoothing” of the current- 
voltage curve. In order to study the influence of this term, we replace the factor 
e*/8 in (4.21) by 67/8 and choose various values for ô. Clearly, only 6 = € 
corresponds to the physical situation. Figure 4.3 shows that the characteristics 
become “smoother” for larger values of 6. 


Viscous quantum hydrodynamic equations. We model the inelastic col- 
lisions as electron interactions with a heat bath of oscillators in thermal equi- 
librium (which models the semiconductor crystal). Castella et al. [CE*00] 
derived for such a situation the collision operator 


Qi(w) =vA,w+™4A,f + vedive(Vpf) + “divp(Pf). 


The parameters v, v1, V2 > 0 constitute the phase-space diffusion matrix, and 
T > 0 is a friction parameter, the relaxation time. If v = 0 and 12 = 0, this 
gives the Caldeira-Leggett operator [CL83]. This model allows to incorporate 
inelastic scattering in the quantum hydrodynamic equations. Indeed, using 
the definition of the moments, we compute 


nu 


(Qı(Mş)) = vzn, (pQi(MyF)) = VA,(nu) — vV sn — a! 


: 2ne 
(3|p/?Qi(My)) = vA, (ne) + dyn — vodiv, (nu) — sea 
For simplicity, we suppose in the following that vı = v2 = 1/7 = 0. Assuming 
as in the previous section that the temperature gradients and the vorticity 
are of order £?, we obtain the viscous quantum hydrodynamic equations: 
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Fig. 4.3. Influence of the dispersive velocity term (67/8)(nuzx)2 on the current- 
voltage curve. 
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On + div(nu) = vAn, 


2 
(nu) + div(nu ® u) + V(nT) — —div(n(V 8 V)logn) —nVV = vA(nu), 
2 
d,(ne) + div((P + neI)u) — —div(nAu) — nu: VV =vA(ne), 


where P and ne are defined in (4.22), and V is given by (4.14). Notice that v 
is of the same order as £? [GuJ04]. 

Due to the dissipative terms on the right-hand side of the above system, 
the total energy 


d Te ge cle ye? j 
B= f, (Set + 5rlul + SIV? + ZVV] Jax 


is no longer conserved but at least bounded: 


dE F 
— += n(n — C)dx = 0. 
dt Ae R4 ( ) 


However, it is not clear how to prove the existence of weak solutions or the 
positivity of the particle density from this equation. 

A partial existence result, for sufficiently small current densities in the 
isothermal stationary model, is presented in [GuJ04]. The main idea is the 
observation that, in the one-dimensional steady state, we can integrate (4.24) 
yielding nu — vnz = Jo for some integration constant Jo which we call the 
effective current density (since it satisfies (Jo). = 0). A computation now 
shows that 


2 2 
(er) — v(nu)ss = —V?n(n(log i) ee) + (2) + 2v Jo(log n) xx. 
n x nea 

Hence, the coefficient of the quantum term becomes £? /12+v?, and the viscos- 
ity term transforms to 2vJo(log n)ss. The smallness condition on the current 
density is needed in order to control the convective part (Jé/n),. Also in 
[GuJ04], the inviscid limit v — 0 and the semi-classical limit € — 0 have been 
performed. 

The isothermal viscous model has been numerically solved in [JMi06, 
JT06]. The viscosity v has the effect to “smoothen” the current-voltage char- 
acteristics for a tunneling diode, as can be seen from Fig. 4.4 (left). We refer 
to [JMi06] for details of the employed parameters. The curves are computed 
from the isothermal model. Their behavior is unphysical due to the jump from 
a low-current to a high-current state. This effect can be explained by the con- 
stant temperature assumption. Indeed, in Fig. 4.4 (right) a curve computed 
from the nonisothermal equations is presented. The characteristic shows the 
correct physical behavior but the viscosity leads to rather small peak-to-valley 
ratios (ratio of maximal to minimal current density). 
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Fig. 4.4. Current voltage characteristics for a tunneling diode for the viscous QHD 
model. Left: isothermal case for various values of the viscosity; right: non-isothermal 
model. 


4.3 Quantum moment diffusion models 


In this section we derive quantum moment diffusion equations from a BGK- 
type Wigner equation using a Chapman-Enskog method. For special choices of 
the moments, the quantum energy-transport and the quantum drift-diffusion 
equations in the O(e*) approximation are obtained. 


General quantum moment diffusion equations. We consider the Wigner- 
Boltzmann equation (4.1) in the diffusion scaling x’ = az, t = a?t, where 
0 <a <1 is as in the previous section (neglecting the primes): 


a? Ot fa 7 a(p Veto + [Va] fer) T Q( fa), (x, p, t) € R*? x (0, 00). (4.24) 


Our aim is to perform a Chapman-Enskog expansion in the corresponding 
moment equations and to perform the formal limit a — 0. For this, we pro- 
ceed similarly as in [DMR05] using only monomials of even order, for instance 
k(p) = (1, $|p|?,...). Then the quantum Maxwellian My is the formal so- 
lution of the constrained minimization problem (4.3) with given moments 
mo(a,t),..., my(a,t) with respect to the above set of monomials. 

We assume that the collision operator can be written as 


Q(fa) = Qo(fa) + a? Q1( fa), 


with Qo( fa) modeling the elastic scattering and Q1( fa) the inelastic scattering 
processes. In contrast to the previous section, we assume here that the elastic 
collisions are modeled by a BGK-type operator [BGK54], 


1 


T 


Qo(f) = =M; = F), 

where T = T(x, t) > 0 is the relaxation time. Then Qo(f) satisfies the proper- 
ties (4.7). Concerning inelastic scattering, we suppose only that it preserves 
the mass, i.e. (Qi(f)) = 0 for all functions f. 
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Multiplying (4.24) by «(p)/a, integrating over the momentum space, and 
using condition (ii) in (4.7), we obtain the moment equations 


O(K(p) fa) + a~* (dive («(p)pfa) + (K(p)O[V] fa)) = (E) (fa): 


In order to derive the diffusion models, we employ the Chapman-Enskog ex- 
pansion fo = My, + af, which defines f}. The formal limit a — 0 in (4.24) 
gives Qo(f) = 0, where f = lima—o fa and hence f = My, by condition (i) 
in (4.7). Inserting the Chapman-Enkog expansion in the above moment equa- 
tions, observing that the integrals («(p)pMy,) and (K(p)0[Va] Mf.) vanish, 
since «;(p) is even in p, and performing the limit a — 0, we conclude that 


Or(K(p) My) + diva (n(p)pf") + (w(p)O[V] fF") = (K(p)Qi(My)), (4.25) 


where f! = lima—o f}. It remains to determine the limit f!. Since Qo is a 
BGK-type operator, it holds, using (4.24), 


fà = ~=Qol fa) = -r (að: fa +p: Vefa + 0|Va]fa — aQı(fa)), 


which implies in the limit a > 0 that ft = —T(p:- Vs Mf +0[V] Mp). Inserting 
this expression for f! into (4.25) we obtain the general quantum diffusion 
equations 
ôm — div(rdiv(p ® pk(p)Mf} + T(«(p)p0[V]MfF)) + (K(p)O[V] ft) 
= (6(P)Q1(M,)), (4.26) 


where we recall that m = (K(p)My). With the notation (4.5) we see that the 
expression 


ð Or 
a (7 (pipeniricExp (A-&) an) 


ke ° 9 
=: div(B : VA) 


div(rdiv(p ® pr;(p)My)) = 


can be interpreted as a diffusion term, and (4.26) can be formulated in a 
compact form as 
Ad;A — div(B : VA) = g(A), 


where A = (k ® &Mf) and g(A) denotes the lower-order terms in À. A more 
explicit expression can be derived in the cases N = 1 and N = 0 which will 
be discussed in the following subsections. 


Quantum energy-transport equations. Let N = 1 and «(p) = (1, 5|p|). 
For simplicity, we also assume that the relaxation time is constant, 7 = 1. 
Then we can simplify the quantum diffusion equations of the previous sub- 
section. Indeed, employing the formulas (4.9) and 


(Ap|p?6[V] f) = -(P + neL)VV + £nVAV forall functions f, 
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we obtain from (4.26) the evolution equations for the particle density mo = n 
and the energy density mz = ne (see [DMR05]): 


On — div Jo =0, O(ne) — div J2 — Jo- VV = ($|p/?Qi(f)), (4.27) 


2 
Jo =divP—nVV, Jo =divU — (P +nel)VV + —nVAV, (4.28) 


where P = (p®pMf) is the stress tensor, U = ($|p|?p®pMp) is a fourth-order 
moment, and V is given by (4.14). The variables Jo and Jz are the particle and 
energy current densities, respectively. Noticing that the quantum Maxwellian 
can be written here as 


M _ |p|? 
f(z, t) = Exp | Alz, t) - SFr }, 
one can show that the quantum fluid entropy 
n(t) = J M;(Log My — 1)dxdp = | n(A — ne/T + 1)dx 
Ri Re 


is nonincreasing [DMR05]. 

More explicit equations are obtained in the O(e*) approximation. For this, 
we need to expand the terms P, U and the energy ne = f $|p|?7Exp(A — 
|p|? /2T)dp in terms of €?. If VlogT = O(c) and up to order O(e*), some 
tedious computations lead to the expressions 


d 


2 2 
P=nTI- —n(V @V)logn, ne = ZnT- =n logn, (4.29) 


1 2 
U = (d+ 2)nT7E — Z rT(Alog nI + (d+4)(V @V)logn), 


Equations (4.27)-(4.28) with the above constitutive relations for P, U, and 
ne are called the quantum energy-transport equations. Notice that the expres- 
sions of P and U differ from those presented in [DMR05]. We expect that 
this O(e*) model possesses an entropic formulation similar to the classical 
energy-transport equations [DGJ97]| but unfortunately, no entropic structure 
is currently known. 


The quantum drift-diffusion equations. In this subsection we set N = 0 
and choose ko(p) = 1. Then the quantum Maxwellian reads as M;(z,t) = 
Exp (A(z, t) — |p|?/2), and similar as in the previous subsection, we obtain 


ðn — divJ =0, J=divP—nVV, 


where n and P are defined by 


= |p|? = |p|? 
n= Exp | A— — | dp, P= p®pExp | A— — ] dp. 
Re 2 Re 2 
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Again, the electric potential V is given selfconsistently by (4.14). Some analyt- 
ical properties and numerical results for this nonlocal equation can be found 
in [GaM06]. In the O(e*) approximation, we can simplify the above model. In- 
deed, for T = 1, we obtain from (4.29) divP = Vn—-e?nV(AV/n/6\/n)+O(e*), 
and up to order O(e*) the quantum drift-diffusion equations 


ðn + Saiv(nv(2)) — div(Vn — nVV) =0. 


Jn 

This fourth-order equation is of parabolic type which simplifies the analysis 
considerably, in particular compared to the third-order dispersive quantum 
hydrodynamic equations (4.19)-(4.21). Notice that the quantum term can be 
written as 

div (nv ( a) = div div(n(V @ V) logn), 
where V & V denotes the Hessian. 

The quantum drift-diffusion equations can be also derived in the relaxation- 
time limit from the isothermal QHD model including relaxation terms. This 
limit has been made rigorous in [JLM06], for solutions close to the equilibrium 
state. 

The main mathematical difficulty is to prove the nonnegativity of the solu- 
tions. Since the equation is of fourth order, maximum principle arguments can- 
not be applied. The main idea of the existence analysis in the one-dimensional 
situation is the observation that the functional no(t) = f(n —logn)dz is non- 
increasing |[JP00]. More precisely, if the equations are considered on a bounded 
interval such that n = 1 and nz = 0 on the boundary, 


1 
AERA += [(logn)2de + f (10gm) dx + xz fe — C) logndz = 0. 
I 


By Poincaré’s inequality, this provides an H? bound (if C € L®(I)) and 
hence an L° bound for w = logn, showing that n = e” is nonnegative 
(we loose positivity due to an approximation procedure). Applying a fixed- 
point argument, the existence of weak solutions has been proved in [JP00, 
JV05]. The one-dimensional equations are by now well understood and the 
regularity, long-time behavior, and numerical approximation of nonnegative 
weak solutions have been studied [DGJ05, JM06b, JP01, JV05). 
Unfortunately, the above idea does not apply in the multi-dimensional case 
since the functional f(n — log n)dx seems not to be nonincreasing anymore. 
The new idea is to show that the entropy m (t) = f n(logn—1)dzx is bounded, 


d e? 1 

2n +5 n(Y 8 V)logn|?de +4 | Vvaltde+ xe | (n—C)ndz = 0, 
“dt Rd Rd A$ Jira 
where V & V denotes the Hessian. Since the entropy production integral can 
be estimated as 
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Fig. 4.5. Current voltage characteristics for a tunneling diode from the quantum 


drift-diffusion model for temperature T = 300K (left) and T = 77K (right). 


J (V 8 V)Vn|?dx < ef n|(V @ V) log n|?dzx, 
R4 Ra 


for some constant c > 0 which depends on the space dimension d, this provides 
estimates for y/n in H? and shows that n = (yn)? > 0. These estimates allow 
for a fixed-point argument (see [GST06, JMO6b] for a proof in the case of 
vanishing second-order diffusion and vanishing electric fields). 

Concerning the stationary equations, an existence analysis, even in several 
space dimensions, can be found in [BU98]. When neglecting the second-order 
diffusion (zero temperature case) and the electric field, we obtain the so- 
called Derrida-Lebowitz-Speer-Spohn equation [DL*91], for which additional 
nonincreasing functionals have been found [JM06a]. 

The current-voltage characteristics for a tunneling diode, computed from 
the one-dimensional quantum drift-diffusion equations, are shown in Fig. 4.5 
(left) with the lattice temperature T = 300K. We see that the model is not 
capable to reproduce negative differential differential effects at room tempera- 
ture. However, when using a smaller lattice temperature, negative differential 
resistance can be observed (Fig. 4.5 right). 

The quantum drift-diffusion model produces good numerical results when 
coupled to the Schrodinger-Poisson system employed in the channel region 
[EJ05]. This can be seen from Fig. 4.6 in which the coupled quantum 
drift-diffusion Schrddinger-Poisson model is compared with the Schrödinger- 
Poisson system and the coupled drift-diffusion Schrodinger-Poisson equations 
(see [EJ05] for details). 
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1 Introduction 


Nanostructures are one of the basic features of quantum electronic semicon- 
ductor devices. In order to cover quantum effects in semiconductor device 
simulation, one has to compute the states of the acting electrons and one has 
to incorporate adequate information about these electronic states into the 
device simulation tools, which usually operate on a semi-classical level. 

In semiconductor devices one basically distinguishes three spatial scales: 
the atomistic scale of the bulk semiconductor materials (sub-A), the scale 
of the interaction zone at the interface between two semiconductor materials 
together with the scale of the resulting size quantization (nanometer) and the 
scale of the device itself (micrometer). 

At the ab-initio level, the many-body Schrédinger equation for the elec- 
trons in the potential of the nuclei gives a complete description of the elec- 
tronic structure for a semiconductor bulk material. The electrons present in 
semiconductor materials can be subdivided into two classes: the core electrons 
and the valence electrons. On the energy scale, the core electrons have much 
lower levels than the valence electrons. This allows to decouple them from the 
valence electrons. Together with the nuclei, they form the ionic cores. The 
valence electrons are responsible for the chemical bonds and for many elec- 
tronical and optical properties. The electronic states of the valence electrons 
can be described with high accuracy in the framework of density functional 
theory. This approach results in a one-particle Schrodinger equation on the 
atomistic scale with a potential given by the ionic cores and the mean field 
contribution of the interaction between the valence electrons. 

The prototype semiconductor nanostructure is a quantum well structure 
which consists of a stack of different semiconductor materials grown on a 
substrate, see Fig. 1.2. The thickness of these layers usually ranges from 2 
to 20 nm, defining the nanoscale. The characteristic feature of the material 
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Fig. 1.1. Schema of a SMQW laser diode by HHI, Berlin. Holes injected from the 
p-contact and electrons injected from the n-contact recombine in the optical active 
region. The optical active region consisting of six quantum wells is enlarged on the 
right. Ee and E, are the band edge profiles for the electrons and holes. The quantum 
confinement of the electrons and holes within the wells is indicated. 


interface between two layers is the abrupt change of the parameters of the 
crystal (chemical composition, band structure) on a distance of the same order 
of magnitude as the lattice constant. Basically, this leads to a coupling of 
the electronic states of the different materials meeting at the interface. The 
variation of material properties across nanoscale heterostructures induces size- 
quantization. 

The nanostructure is in the focus of the envelope function approximation, 
which can be understood as a homogenization method with Bloch waves. It 
leads to k-p multi-band Schrödinger-type equations with position-dependent 
effective mass tensor and band-edge profile, which are state of the art for the 
description of electronic states in semiconductor nanostructures. 

Figure 1.1 shows the essential components of an edge-emitting strained 
multiple quantum well (SMQW) semiconductor laser diode, the optical active 
zone of which is a stack of quantum wells separated by barriers from a different 
material. The transversal simulation of such a laser deals with a cross section 
measuring up to several microns in diameter. Semi-classical models working 
on the device scale such as drift-diffusion equations are state of the art for 
the simulation of the electronic behavior of many microelectronic devices. For 
opto-electronic devices based on nanostructures they can be applied success- 
fully, supposed the constitutive laws in these semi-classical models take into 
account quantum effects from smaller scale models for the embedded nanos- 
tructure [BGK00, BHK03, BK*03, BGHO05). 

The present paper focuses on the two scale transitions inherent in the hi- 
erarchy of scales in the device. In section 2, we start with the description of 
the band structure of the bulk material by k-p Hamiltonians on the atom- 
istic scale. Sect. 3 describes how the envelope function approximation allows 
to construct kp Schrödinger operators describing the electronic states at the 
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Fig. 1.2. Quantum well: one-dimensional semiconductor nanostructure consisting 
of two semiconductor materials A and B and two planar material interfaces. The 
unit cell for a binary semiconductor with zinc-blende crystallographic structure like 
gallium arsenide is displayed. 


nanoscale which are closely related to the k-p Hamiltonians from Sect. 2. Spe- 
cial emphasis is placed on the possible existence of spurious modes in the k-p 
Schrodinger model on the nanoscale which are inherited from anomalous band 
bending on the atomistic scale. Sect. 4 is devoted to the mathematical analy- 
sis of these multi-band k-p Schrédinger operators. Besides of the confirmation 
of the main facts about the band structure usually taken for granted, key 
results are conditions on the coefficients of the k-p Schrodinger operator for 
the nanostructure, which exclude spurious modes and an estimate of the size 
of the band gap. Sect. 5 gives an overview of properties of the electronic band 
structure of strained quantum wells. Further, the assumption of flat-band con- 
ditions across the nanostructure allows for upscaling of quantum calculations 
to state equations for semi-classical models. In Sect. 6 we demonstrate this 
approach for parameters such as the quantum corrected band-edges, the effec- 
tive density of states, the optical response, and the optical peak gain. Sect. 7 
is devoted to the application of the k-p Schrödinger theory to low gap quan- 
tum wells, a case where a proper rescaling of the optical matrix element is 
necessary to avoid spurious modes. In Sect. 8 we discuss the application of 
the k-p Schrödinger models to biased quantum wells, the operation mode of 
electro-optic modulators. 


2 Near-band-edge states in semiconductor bulk crystals 


The key property of bulk semiconductor materials is, that the atoms form a 
periodic Bravis lattice defined by its crystallographic unit cell, see Fig. 1.2 and 
[Car96]. The electronic states of the valence electrons in a semiconductor are 
essentially given by the solution of the eigenvalue problem for a Schrödinger 
operator 
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h2 
H = —-——A+ Vesp (r). (2.1) 
0 


2m 


mo is the electron rest mass and Very is the effective potential, consisting 
of the potential of ionic cores (nuclei and core electrons) and the mean field 
interaction between the valence electrons. This potential can be given by em- 
pirical pseudopotential method (EPM) [CB66, CC76, Car96]. Alternatively, 
density functional theory allows to obtain the electronic states by solving an 
effective one-particle Schrödinger equation with a periodic potential. 

Due to the translation invariance of the lattice periodic potential, the 
eigenfunctions of the Schrödinger operator are Bloch waves 


W(r; k) = e!*u(r; k), (2.2) 


defined by a lattice periodic Bloch function u(r; k) depending on the real space 
vector r = (x,y,z) and parametrically on the wave vector k = (kz, ky, kz), 
Bloch theorem see [Blo32, Car96]. The Bloch waves as well as the corre- 
sponding eigenvalue curves E(k) are periodic in the wave vector k. Thus, it 
is sufficient (reduced zone scheme) to restrict the considerations to the first 
Brillouin zone, the unit cell of the periodic lattice in the k space, see [Car96]. 


2.1 k-p equation for Bloch waves 


The Bloch wave ansatz (2.2) leads to an eigenvalue problem for the Bloch 
function u(r; k) [Kan66, Kan82]. Including the spin degree of freedom by using 
a two-component Bloch function one arrives at the at the k-p equation. Using 
the notation of [Bah90] it reads as: 








Hun(r; k) = En(k)un(r; k) (2.3) 
with 
H = Ho t Hry } Hk } Ho Hkso 
h? h hk? 
Ho = -——A+V. » Ay = —k-p, He= f 
JSA N a Me AA ioe 
h2 

Hoo = gpa (Wess) x p): a, Hrso = grga (Vers) x k) +ø. 

p denotes the quantum mechanical momentum operator defined by p = —iħV. 


o = (Ox, Cy, 0z) is the vector of the Pauli spin matrices 


(01 (Osi (10 
s= hio Zuo Tlo: 


Ho and Hķkso describe the spin-orbit interaction. 

The eigenvalue curves E,,(k) are the energy bands of the valence electrons 
in a semiconductor material. Together they form the electronic band struc- 
ture. The essential property of the band structure in a semiconductor is the 
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Fig. 2.1. Schematic view of the band structure in a direct semiconductor material 
(the lowest conduction band and the three topmost valence bands). Due to the 
Kramers degeneracy (spin degeneracy) each state is doubly degenerate. For indirect 
semiconductor materials the conduction band minimum is located outside the zone- 
center k = 0 as indicated by the dashed line. Due to crystal symmetry there may 
exist multiple equivalent band minima in this case, for instance, six in silicon. 


existence of a fundamental spectral gap. Energy bands below and above this 
gap are valence bands and conduction bands, respectively. For the cases of in- 
terest, the maximum F, of the valence band is located at the I point, center 
k = 0 of the Brillouin zone. In direct semiconductor materials such as gallium 
arsenide, the minimum Ee of the conduction bands is also located at the I’ 
point, whereas for indirect semiconductors such as silicon, it is located outside 
the zone-center. The band-edges E. and E, define the band gap Eg = E.— Ey. 
Fig. 2.1 shows a schematic band diagram. In thermodynamic equilibrium, the 
carriers occupy the states near the band extrema. In particular, the conduc- 
tion band states are occupied by the free roaming electrons and the valence 
band states are occupied by the positively charged holes. Therefore, for many 
applications it is sufficient to confine the description of band structure to the 
near-band-edge states. 


2.2 Eight-band k-p Hamiltonian for near band-edge states 


Basically, any Bloch function u(r, k) 
center solutions už (r) = un(r,k = 0): 


k) = So Cur(byul (r) 


The first d near-band-edge states uf labeled n = 1,...,d form the set of 
class-A bands. All remaining states, labeled class-B, are assumed to have only 
a small influence on the near band-edge states. Typical number of class-A 
bands are 1, 4, 6 or 8 [Kan82, Bas88, Chu95, Bah90, MGO94]. The class-B 


can be represented in terms of the zone- 
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bands are far away from the band-edges FE, and E, and thus have only a small 
contribution to a state un(r, k) of class-A near zone-center. This gives rise to 
the representation 


d 
Un(r, k) & 5 en (kjui (r), forn=1.---,d, (2.4) 
n'=1 


Taking into account remote band effects arising from the influence of the class- 
B states by means of Löwdins perturbation scheme one arrives at a nonlinear 
eigenvalue problem for the class-A bands, see [Kan66, Kan82]. A suitable 
linearization of this problem provides the corresponding Hamiltonian. 

In the following we regard eight class-A bands consisting of the lowest con- 
duction band and the three topmost valence bands, all doubly degenerate, see 
Fig. 2.1. For materials with diamond or zinc-blende crystallographic structure 
the space of class-A bands is spanned by S 1, X 7,Y 7,717,595 1,X 1, Y J, 
Z |, where fî and | indicate spin up and spin down, respectively. Following 
the notation of [EW96], the Hamiltonian reads as: 








_ (K(k) +i-G,+E r 
Hsxs(k) = ( T K(k) -i-G xp) (25) 
where the k-p matrix is given by 
A-k? iPokx iPoky iPokz 
KHE —iPok, (L — M)k? + Mk? Nkzky Nkzkz 
~ | -iPoky Nkzky Lk? + M (k? + k?) Nkykz 
—iPokz Nkykz Nkykz Lk? + M(k2 + k2) 
00 00 0000 
Aso | 00—10 _ Aso [0001 
oes 3 101 0 0 ai 3 |0 0 0-i 
00 0 0 0-11 0 
E, 0 0 0 
ges 0 —Ago/3 0 0 
~ 10 0 -Ago/3 0 
0 0 0 -Ago/3 


As their influence is usually neglected, the matrix elements describing the 
influence of bulk inversion asymmetry of potential Ver¢ and the influence of 
k dependent spin-orbit interaction have been omitted, see [Kan82, Bah90]. 

The parameters of the Hamiltonian are the (parabolic) conduction band 
mass Me, the Luttinger parameters yt, 73,74 describing the the heavy hole 
masses in different crystallographic directions, the band gap energy Eg, the 
spin-orbit split-off energy Aso, and the momentum matrix element P). These 
parameters define the coefficients of the Hamiltonian by 
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h? P? h? h? P? 
L=- 4 = M = —— (9? — 254), N = -6—7 + 
2mMo T TE z) 7 Ez’ 2mMo (1 12’); Comin 7 Eg 

(2.7) 


R2 mo E, + 2/34 
Ai a p 2, 
2Mo Me ° E (Eg + Aso) ae 


Po is defined by the Bloch functions S and X 





Pye as | Cee 2 X(r)dr, (2.9) 
unit cell 1 Ox 


mo 
and is a measure for the coupling between the conduction bands and the 
valence bands. It is also known as the optical matrix element, which plays a 
role in the calculation of the strength of optical transitions. Usually it is given 


by an energy parameter 
2mo 


he 
It is possible to obtain the parameters Ey,Aso, Me, yb yk, y and Ep of the 
Hamiltonian from experimentally determined properties of the bulk material. 
There exists a compilation [VM*01] of these band parameters for the 12 major 
HI-V binary semiconductor materials and their ternary and quaternary alloys. 
In the literature, also simplified k-p Hamiltonians for the valence bands 
are used. They include the four-band Luttinger-Kohn Hamiltonian for heavy 
holes and light holes and six-band valence band Hamiltonians [Bas88, Chu95, 
Car96]. For semiconductors with wurtzite crystallographic structure such as 
gallium nitride k-p Hamiltonians have been established as well [CC96]. 


Ey = =P? (2.10) 


2.3 Anomalous band bending 


The reciprocal masses A and L on the diagonal of (2.5) depend on the ratio 
Ç = E,/E,. For ¢ > Cerit one of them changes the sign. For large wave 
vectors k parallel to one of the axis in the k space, these diagonal k? terms 
dominate the behavior of the energy bands. Thus, for ¢ > Cerit the energy 
dispersion becomes anomalous in the sense that e.g. for A < 0 the bending 
of the conduction bands becomes negative, see Fig. 3.2. It is known, that this 
behavior of the band structure can lead to the formation of spurious modes, 
if k-p method is applied to heterostructures, see Sect. 3.4 and [For97, S0103, 
MGO94). 


2.4 Modification of the band structure by mechanical strain 


If a semiconductor material is grown on a substrate with a different lattice 
constant, e.g. indium gallium arsenide on gallium arsenide, we observe me- 
chanical strain induced by the lattice mismatch between the two crystals. 
Basically, this mechanical strain leads to a shift of band-edges Ee, Ey result- 
ing in an altered band gap E, and to a splitting of the heavy hole and light 
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hole bands at the I’ point. Additionally, it has an impact on the anisotropy 
of the effective mass tensor and the warping of the band structure [CC92]. 

The influence of strain £ can be included in the k-p Hamiltonian by adding 
the Pikus-Bir deformation interaction matrix D(e) defined by deformation 
potentials [BP74, Bah90, Chu95, EW96]. 


2.5 Mixed crystal systems 


So far, our considerations apply to elementary semiconductors like silicon or 
binary semiconductors such as gallium arsenide or indium phosphite. Mixed 
crystal systems are alloys of binary semiconductors. They are of particular 
interest, because they allow for band gap engineering in the case of ternary 
compounds like Inj_,Ga,As and additionally for design of the mechanical 
strain in the case of quaternary materials like Inj_,GazAsyP1_,. Though 
strictly speaking, the basic assumptions of a periodic crystal lattice do not 
hold for these materials, they can be treated by similar methods assuming the 
so-called virtual crystal approximation. 

Within the kp framework the band structure of mixed crystal systems can 
be described by the same Hamiltonian used for the elementary and binary 
semiconductors. Their parameters can be obtained by interpolation of those 
of the binary constituents. Often linear interpolation according to the mole 
fraction x yields a sufficient approximation. For some parameters such as the 
band gap energy E, this interpolation has to include a bowing parameter. All 
the related data can be found in [VM*01]. 


3 Envelope function approximation for layered 
semiconductor nanostructures 


We have discussed the modeling of the band structure of bulk materials by 
k-p Hamiltonians. Now, this approach is extended to layered semiconductor 
nanostructures like quantum wells, multiple quantum wells (see Fig. 1.2) and 
double barrier structures [Sin93], [Bas88], [Chu95]. For these heterostructures, 
the translation invariance is broken and the microscopic potential Ves can- 
not be periodic in all space directions. Nevertheless, the heterostructure is 
a crystalline solid and the atoms form an approximate Bravis lattice. The 
microscopic potential now consists of a oscillatory part, which corresponds 
to the potential of the ionic cores, and a globally slowly varying part, which 
corresponds to the composition of the heterostructure from different bulk ma- 
terials, see Fig. 3.1. 


3.1 Envelope function approximation of the wavefunctions 


This obvious two-scale nature of the microscopic potential and of the corre- 
sponding microscopic wave function encourages to treat this problem as a mul- 
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Fig. 3.1. Schematic view of the microscopic potential V (z), the atomistic wave 
function W, the Bloch waves un(r) and the envelope function F(z;k) in a one- 
dimensional heterostructure consisting of atoms of type A and B. The lattice con- 
stant is ao (typically 0.5 nm). un(r) is highly oscillatory on the sub-Å-scale. 


tiscale problem. After first heuristic approaches to this problem, see G. Bas- 
tard [Bas88], the first rigourous approach was made by Burt [Bur92, Bur94], 
who showed, that it is possible to derive a theory for the slowly-variing part 
of the wave functions, the envelope functions, in terms of the atomistic scale. 
This approach leads to a k-p Hamiltonian for the envelope functions. 

For layered nanostructures, the crystal remains periodic in the in-plane 
directions rj = (x,y), perpendicular to the growth direction z. This choice 
of the coordinate system corresponds to epitaxially grown nanostructures on 
[001] oriented substrates. As in bulk k-p theory, we approximate the electronic 
state in the nanostructure in the subspace spanned by d lattice periodic, zone- 
center Bloch functions uł (r), n = 1,...,d of class-A. However, we replace the 
coefficients cn in (2.4) by the envelope functions Fn (z; kj) depending on the 
reduced wave vector ky = (kx, ky): 


d 
Wr; ky) = exp (iky - ry dF (z; kyu P(r). 


3.2 k-p-Schr6édinger operators for layered nanostructures 


According to [Bur92],{Bur94],[Bur99] the vector of the envelope functions F = 
(Fi, ++- , Fa) for a given reduced wave vector ky is the solution of an eigenvalue 
problem 
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(3.1) is a family iof spatially one-dimensional eigenvalue problems indexed 
by ky. It provides a description of the in-plane band structure for layered 
nanostructures, for examples see Sect. 5. 

Far away from the interface between two materials, the Schrödinger op- 
erator H is formally identical with the bulk Hamiltonian (replacing k; —> 
—i0/0z) shifted by the band-edge E, [Bur92, Bur94]. Therefore, the band- 
offset AF, between two materials enters as an additional parameter, available 
for the many material interfaces [VdW89, Kri91, VM*01]. Near the interface 
between two materials, non-local effects are present which lead to a coupling 
of the states across the interface [Bur92],[Bur94]. The approximation of these 
non-local interactions by an interface condition for the envelope functions is 
sufficient for many applications. 

However, one has to be aware that the Burt-Foreman approach relies on 
the assumption that the Bloch functions used in the approximation are the 
same for all material layers. As a consequence, the momentum matrix element 
Po has the same value in all materials. However, the experimentally measured 
value of Po given by the optical matrix element Æp (2.10) is varying. Therefore, 
it is necessary to incorporate this effect in the k-p Schrödinger operator by 
additional interface conditions. 

This suggests the following general structure of the Schrödinger operator: 


e In each material layer, the Schrödinger operator is derived from a bulk 
Hamiltonian like (2.5) resulting in a system of d coupled stationary 
Schrödinger equations, see (4.1). 

e At the material interfaces the continuity of the envelope functions and of 
a flux vector (4.10) is assumed. 

e Ifthe optical matrix element Po differs only slightly between the materials, 
one can define an effective value by suitable averaging. Otherwise, by an 
additional interface condition, one has to define the coupling between the 
envelope functions related to the Bloch functions S and envelopes related 
to the Bloch functions X, Y, Z across the interface [Bur99, For97]. 


The conventional k-p Schrodinger operators have been derived from the 
bulk Hamiltonian by replacing k, — —i0/0z, see for instance [Bas88]. This ap- 
proach also has to be supplemented by coupling conditions at the material in- 
terfaces. These interface conditions have been established implicitly using the 
so-called operator ordering. For the second order terms the BenDaniel-Duke 
operator ordering Ak? — —0/0zA(z)0/0z has been used. For the first order 
terms the heuristic symmetrization rule Ak, > —4[A(z)0/0z + 0/02(A(z)-) 
has been applied [Bas88]. The resulting interface conditions differ from those 
derived by Burt-Foreman. For a comparison of the different interface condi- 
tions see [For93, MGO94]. 
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3.3 Confined states in quantum wells 


A quantum well structure consists of a well material layer embedded between 
two barrier material layers such that band-edge profiles E,(z) and E,(z) form 
a potential well for both holes and electrons, see Fig. 1.1. This potential well 
leads to the localization of the carriers in the quantum well region. This effect 
is known as carrier confinement. Another key feature of quantum wells is the 
appearance of discrete energy levels due to the size quantization induced by 
the small width of the quantum well, typically ranging from 2-20 nm. Repeated 
quantum well structures form multiple quantum wells (MQW), see Fig. 1.1. 

The effect of carrier confinement is utilized in strained multiple-quantum 
well (SMQW) laser diodes in order to increase the density of states and the 
material gain in optical active region of the device. The confined carriers in 
quantum wells are described by the bounded states of the corresponding k-p 
Schrödinger operator (3.1) which are characterized by the discrete part of 
the spectrum. The bounded states decay exponentially in the barrier region. 
Therefore it is possible to use a finite-domain approximation by artificially 
cutting out a simulation domain 2 and applying homogenous Dirichlet (hard 
wall) or Neumann (soft wall) boundary conditions. 


3.4 Spurious modes 


It is known [For97, Sol03] that if the ratio ¢ = E,/E,, exceeds a critical value 
Cerit due to anomalous band bending (see Sect. 2.3) spurious modes occur 
as eigenfunctions of the k-p Schrodinger operator. These spurious modes are 
concentrated around a large value of kz, typically outside the first Brillouin 
zone, and thus spatially oscillatory. The spurious modes lead to a pollution of 
the spectrum of the operator or even to band gap solutions [For97, S0103]. 







































































Fig. 3.2. Schematic view of the dependence of the bending of the energy bands on 
the ratio Ç = Ep/Eg. In the case Å > Cerit the bending for large k vectors becomes 
anomalous. In the case of heterostructures this behavior can lead to pollution of the 
spectrum by spurious modes or even to band gap solutions, as indicated. 
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This difficulty can be overcome in several ways: 


e Appropriate rescaling of the conduction-valence band coupling to achieve 
¢ < Cenit: 
Fitting of a set of bulk band parameters which ensure Ç < Cerit- 
Approximation of the eigenfunctions in a function space which guarantees 
kz < kerit, kerit is the value for which the bending changes its sign. 


[Sti01] achieved a conforming set of band parameters by fitting the Hamil- 
tonian to the real bandstructure in a larger part of the Brilluoin zone (~20%) 
and not only at the zone center. These effective parameters meet the con- 
ditions given by Property 4.2 in Sect. 4 which guarantee that there are no 
spurious modes. 

We have obtained a conforming parameter set by rescaling, see Sect. 7. 


4 Mathematical analysis of k-p Schrodinger operators 


We review the spectral properties of Schrédinger type operators occurring 
in k-p theory of layered semiconductor heterostructures. In this section we 
denote the growth direction of the layers by x and reduced wave vector ky by 
k = (kı, k2) € C2. The formal structure of these operators is — for d bands 


p= (Pispa): 


- Elm) +3 (Magen zoe) 


=1 


+O eD (Manga Ze (Hasse) 


a=1,2 


+ Shs Yves pz alg Saas 


a=1,2 


(4.1) 





a 





+Y ojpi tejp, j= Ted 
l=1 


We assume the following general properties of the coefficients mj, Mo j1, 
Maji, Uajt, Uagjt, ¥j1 and ej on the space intervall 2 = (£0, £z) of the 
coordinate of quantization: 


Property 4.1. All coefficients are essentially bounded, namely 
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mj E€ L°(2,R), Peal et eds 

ej E€ L*(2,R), PS hus 4d, 

Ma € L™(2; B(C%)), a € {0,1,2}, 

Ua € L®(Q; B(C4)), a € {1,2}, 

Uap E€ L™(2; B(C%)), a, B € {1,2}, 
v € L™(2; B(C%)), 


where B(C%) is the Banach space of bounded linear operators on C. 


Property 4.2. The set of band indices is a disjoint union {1,...,d} = D,UD_ 
of conduction and valence bands that means 


min vraiminm,(zx) > 0, max vraimaxm,(x) < 0, 
jEDy LEN jEeD_ LEN 

min vraimine;(x) > 0, max vraimaxe;j(x) < 0; 
jED4l rEQ JED; TEN 


D, or D_ may be empty. We introduce the conjugation operator © on C? by 


1 ifje D4, 


O(c1,...,Ca) = (r1 C1,- - -, Ta Ca), Tj = 2 ifjeD 


Property 4.3. For almost all x € 2 and all a, 8 € {1,2} the operators Ua(x), 
Ua g(x), and v(x) are selfadjoint over C?. 


Property 4.4. There is a finite, disjoint partition £o < zı < ... < xz of the 
interval 2 = (zo, £z) such that the functions m; € R, j = 1,...,d, and 
Ma € B(C%), a = 0,1,2, take exactly one value Mj and Mot, respectively, 
on each of the intervals [21, 7141). 


Following [BK*00] we define parts of the k-p Schrödinger operator between 


Wo” as W3? (9; C?) and its dual space W-1”, the space of anti-linear forms 


on Wee: For y,w € Wo” we set 


£ D i 
(ew) =>) [ge gba, (4.2) 


$ (M3 (2) ola), Ewa) dz, a=0,1,2, (4.3) 
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(Ba et) = | Wale) oa), la) ex de a=1,2, (44) 
(Booed) = | Waala) ee) Ve))eede, aB=1,2, (45) 


Ves I (v(a) o(a), (0) ox dr, (4.6) 


d —" 
(Ey, p) = > ej pj(£)y;(x) dz. (4.7) 


Now we define for each reduced wave vector k = (k1, k2) € C? the k-p 
Schrödinger operator 


H; : W7? — W7? (4.8a) 


by the sum 


Hy = H +40 + X kalAa + Ba) + X` hake Bag +V + E. (4.8b) 


a=1,2 a,B=1,2 


The terms in (4.8) relate to (2.5): E represents the basic energies of the (class 
A) bands involved; V contains the spin-orbit interaction and the influence of 
strain; the operators 


Aot So kaBa and H+ YÑ. kasat » ka kg Bag 


a=1,2 a=1,2 =1,2 


describe the first and second order k-p interactions, U They rep- 
resent, e.g. for diamond like crystal structures, interband (interaction within 
conduction bands and valence bands, respectively) and intraband (interac- 
tion between conduction and valence bands) coupling, respectively. Making 
use of the conjugation operator O we can split the Schrödinger operator into 
intraband and interband coupling terms in the following way: 


1 1 
pre Hk + OH;0), Fy inter =e 


zl 5 (Hk — OHO). 


Hpk intra = 


4.1 Spectral properties 


We first state spectral properties of the operator (4.8) on the space W~1?. 


Theorem 4.5. See [BK *00]. We assume Properties 4.1-4.4. For any k € C? 
the operator (4.8) has the same domain as H, namely Wie and all these 
operators are closed and have a compact resolvent. For any one dimensional 
complex analytic submanifold S of C? the operator family {Hk}itres} is a 
holomorphic operator family of type (A), see [Kat84, VII.2]. The spectrum of 
A, only consists of at most countably many eigenvalues with finite multiplicity, 
which do not accumulate at any finite point. 


Electronic States in Semiconductor Nanostructures 379 


The restriction of the operator (4.8) to the space L? $ £?(Q;C4) has the 
following spectral properties. 


Theorem 4.6. See [BK*00]. We assume Properties 4.1-4.4. The spectra of 
Hy| 2 and H; are the same. The resolvent of Hy| z2 is nuclear. For any k € C? 
the geometric spectral multiplicity is at most d. The domain of Hp|r2 is given 


by 


dom(Hylz2) = W220 fe | Chenerat € Wienen), 


o a) is ie 
as li eae as e is li meee M* = M* 
Pu dia, eee) = ea Ji eee) + Mines -iole 
+ XO ka(M 141 - Mz) p(x) =0, 1=0,...,L- n), (4.9) 
a=1,2 


For functions y from the domain of Hp|z2 the vector 


mZ ola) — (Mia) + E kaMi())gla) (4.10) 


a=1,2 
is continuous across the material interfaces. 


Theorem 4.7. See [BK*00]. We assume Properties 4.1-4.4. If the reduced 
wave vector k = (kı, k2) is from R?, then the operator Hy| 12 is selfadjoint, has 
an orthonormal basis of eigenfunctions in L?, and its geometric and algebraic 
eigenspaces coincide. 


We now investigate how the spectral properties of the operators H;,|,2 de- 
pend on the reduced wave vector k. Unfortunately, the domain of the op- 
erators Hp|z2 is not independent of k. Hence, in contrast to Theorem 4.5, 
the concept of a holomorphic operator familiy of type (A) does not apply 
anymore. However we can prove, see [BK*00], that for any one dimensional 
complex analytic submanifold S € C? the family {Hp|z2}kes, is an analytic 
family of operators in the sense of Kato [Kat84, VII.1.2]. This implies that 
a closed curve, separating two parts of the spectrum of Hp for k = ko, also 
separates corresponding parts of the spectrum of H; for k from a suitable 
neighbourhood of ko, see [Kat84, Th. VII.1.7]. 


4.2 Gap estimate 


From the point of view of electronic structure calculation it is of interest for 
which k € R? the spectral gap between the positive and negative parts of the 
band-edge operator FE can be found in the spectrum of Hg, and how one can 
estimate the size of the gap in terms of k and the data of the problem. This 
relates also to the problem of spurious modes in bandstructure calculation. 
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According to Property 4.2, the lower bound 


def i ae 
os a a le;(x)| (4.11) 


g=1,..., 


of the spectral gap in F is positive, and so is the lowest eigenvalue pu of |H]. 
We define the weighted strength of the band couplings by 


2 
aer 9% Mall Ee (2:81) 
min avreman |m,(2x)| 


yas 


(4.12) 


Moreover, we assume the following property of the coupling matrix Mo: 


Property 4.8. Mo(x) is skewadjoint and © Mo(x) is selfadjoint for almost all 
wer 


Property 4.8 implies Mo + O©Mo0O that means that the intra-band part of 
the operator Mo,intra, see (4), vanishes. This property is satisfied by the usual 
eight-band Hamiltonian and its heterostructure equivalent, see (2.5). The ske- 
wadjointness of Mo is a consequence of symmetric operator ordering of the 
first order terms of conduction-valence band coupling. 


Theorem 4.9. See [BK*00]. In addition to Properties 4.1-4.4 and Property 
4.8 we assume that for almost all x € N 


QU 4 (2x) are skewadjoint, a € {1,2}, (4.13) 
OU a ox) are nonnegative operators, a € {1,2}, (4.14) 

and i 
5 pee spec (OU,,(x) + U;,(x)@) p> 0, (4.15) 


where 


U, (x) $£ cos? nUn (x) + sin? nU29(x) + sin 7 cos 7(Uy2(x) + Uai(a)). 


If |k|, and A satisfy the relations 


0 < à < e- |lullz-(0,01),6(C4)): (4.16a) 
nee (4.16b) 
STA 


M 
0< p- ki v2 (u+ Z) + Ikr- lolze te-à (4160) 


for some 6 > 0, then X belongs to the resolvent set of Hp. 
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Thus, we have established a spectral gap (—A,+A) in Hx. In case of v = 0 
(4.16c) means that Hp at zone center k = 0 has the same spectral gap as the 
band-edges Æ. Thus, spurious modes, that means band gap solutions, can be 
excluded in this case. 

By properly choosing 6 in (4.16) one may obtain sharp estimates of the 
gap, see [BK*00]. For the optimal choice of ô = ôopt(|k|) one obtains the 
following range of k 


e — |lv|l L(.2,8(c4)) 


0< |kI? < ST 


(4.17) 


for which a spectral gap in Hy persists. 


4.3 Remarks 


Apart of k-p Schrödinger operators with interband coupling there are also 
operators with a positive or negative definite main part. In the terms of Prop- 
erty 4.2 this means that D_ = Ý or D} = 9, respectively. Important examples 
are the 4-band Luttinger-Kohn-Hamiltonian [Chu95] and 6-band valence band 
Hamiltonians, see [Car96, CKI94, Chu95, For93, MGO94, Sin93]. If the oper- 
ator H from (4.2) is definite, then the operators H; are semibounded and one 
obtains more results about the way the eigenvalues and eigenvectors depend 
on k, see [BK*00]. 

The whole theory considerably simplifies, if one relaxes Property 4.4 such 
that the coefficient functions Ma, a = 0,1,2 are continuous. This allows for 
a regularization of the operators (4.8). One can prove, see [BK*00], that the 
resolvents of a regularizing sequence converge in trace class to the operator 
(4.8) with jumping coefficients Ma. 


5 Electronic states in strained quantum wells 


The band structure in quantum wells is given by the eigenvalue curves E; (ky) 
and the vector of eigenfunctions F;(z; kj) = (Fi,1(z;hy),--., Fi,a(z; ky) of 
the k-p Schrödinger operator (3.1) depending on the reduced wave vector 
ky = (kx, ky). We regard an eight-band kp Hamiltonian of the type (2.5) in 
the formulation given by [EW96]. Thus, the mathematical results of Sect. 4 
apply. The numerical calculations have been performed by means of WIAS- 
QW [BKqw] using a finite volume method. 

In particular, the structure under consideration is a Inj_,Ga,As,Pi_y 
based strained MQW structure. The stack consists of six 1% compressively 
strained 7 nm thick quantum wells (x = 0.239, y = 0.826), which are separated 
by 10 nm thick 0.3% tensile strained barriers (x = 0.291, y = 0.539), see 
[BHK03]. As confirmed by the calculation of the mini-band formation in the 
MQW structure [BHK03], the barrier width is such that the lowest states in 


382 T. Koprucki, H.-Ch. Kaiser, J. Fuhrmann 























= 1100 
oO 
£ 1000 
> 
9 
2 900 
W 
800 
0 
> 
© -50 
E 
& -100 
o 
i m 
ui -150 
-200 
0.0 0.2 0.4 0.6 0.8 1.0 
Ikl [1/nm] 


Fig. 5.1. Band structure and warping in a 7nm InGaAsP compressively strained 
quantum well, calculated with WIAS-QW. Top: conduction bands, bottom: valence 
bands. ky in [100]- (solid) and [110]-direction (dashed). We remark that the bands 
are twice degenerate due to symmetry. Reprinted with permission from [BHKO3]. 
© 2003, IEEE. 


the wells decouple, that means, they are strongly localized in the individual 
quantum wells. This allows to restrict ourselves in the following to single 
quantum well calculations. 


5.1 Band structure 


Fig. 5.1 displays the band structure F;(kj) for the 7 nm InGaAsP quantum 
well under consideration. Due to their different effective mass, in quantum 
wells the confinement energy Econs = Ev,qu — Ei (ky = 0) of the heavy hole 
states is smaller than for light-hole states (Ey, qw is the valence band-edge of 
the well material). Thus, the degeneracy (at k = 0) between light and heavy 
holes in bulk materials, see Fig. 2.1, is lifted. The compressive strain increases 
the splitting between the heavy hole and light hole states while energetically 
favoring the heavy holes. Thus, the highest two valence bands correspond to 
heavy hole states. 

We have investigated two different crystallographic directions to demon- 
strate the angular dependence of the dispersion (warping effect). We observe 
weak warping for the conduction band, and strong warping and strong non- 
parbolicities for the valence bands, see Fig. 5.1. 


5.2 Momentum matrix elements for interband transitions 


Important information on the electronic states within the quantum well is 
encoded in the intersubband momentum matrix elements. For the transition 
from the conduction band state F; to the valence band state F; these are 
defined, see [EBWS95], by 
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Fig. 5.2. Momentum matrix element dispersion |epi;(ky)|? (5.1) for transitions be- 
tween the lowest conduction band CB, shown in Fig. 5.1 top, and the upper valence 
bands shown in Fig 5.1 bottom. Different polarization directions e are shown. Top: 
TE-polarization (p||ex), bottom: TM-polarization (p||ey). Note the normalization to 
the same quantity Po in both pictures. Reprinted with permission from [BHKO3]. 
© 2003, IEEE. 


Fy dz. (5.1) 





Mu i 
Pij = r 5 i Fip (Vr Hu (hj kz)) eae 
LV 

Fig. 5.2 displays the most prominent interband momentum matrix elements 
for TE and TM polarization. These momentum matrix elements have a dis- 
tinctive dispersion and warping. In particular, the dominant transition for 
TE polarization is between the lowest conduction band CB, and the topmost 
valence band HH. 


6 Upscaling to semi-classical state equations 


In the simulation of opto-electronic devices such as strained multiple quan- 
tum well (SMQW) laser diodes, semi-classical models turned out to be very 
successful, provided that information about the optical active zone is derived 
from smaller scale models. This requires suitable upscaling schemes for semi- 
classical constitutive laws. 

The simulation of opto-electronic devices requires at least the description 
of the flow of electrons and holes, the description of the optical field, and 
the coupling of these models by the radiative recombination of electrons and 
holes. In the following, we focus on a semi-classical carrier flow model, and 
on a specific part of radiative recombination; for models of the optical field, 
see [BGK00, BGH05, BK*03, BHK03]. In particular, we deal with upscaling 
schemes for quantities such as the density of states, the optical response, and 
the optical peak gain, from electronic structure calculations as described in 
Sect. 5. 
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6.1 Drift-diffusion equations 


The most popular semi-classical models for the carrier flow in a semiconductor 
device are drift-diffusion models. The basic model of this type is the van 
Roosbroeck system, see [Gaj93] and the references therein, which describes the 
flow of electrons and holes in a selfconsistent field due to drift and diffusion. It 
comprises current-continuity equations for the densities n and p of electrons 
and holes, respectively, and a Poisson equation for the electrostatic potential 


gp: 


On Op : 
—— — ‘In = j —— . = — .1 
IF, Ved qR a tV Ip qR (6.1) 
egV(esVy) = —q(C+p—n). (6.2) 


£o is the vacuum dielectric constant, €s is the static dielectric constant, q 
is the elementary charge, and C is the net doping. The recombination rate 
R in (6.1) involves all non-radiative and radiative recombination processes, 
and depends at least on n and p. The currents jn and jp are driven by the 
negative gradients of the quasi-Fermi potentials F, and Fp for electrons and 
holes, respectively: 


jn = —QbtnV En, jp = —IPbpV Fp; (6.3) 

Un and up are the mobilities of electrons and holes. 
The current continuity equations describing the motion of electrons and 
holes have to be completed by laws for the recombination of electrons and 
holes, and by Fermi-Dirac distributions for the densities of electrons and holes: 


— qk, — Ee E, + qFp — 
n=N-F1/2 (eo) » D=NoFij2 (See . (6.4) 


E: and E, denote the conduction and valence band edges, respectively. Ne and 
N, are the corresponding densities of states (DOS) given by the expressions 


aE 3/2 „kpT 3/2 
n.=2(% B ) l n,=2(% B ) i (6.5) 





2rh2 Qrh2 


Me and m, are the density of state masses, T is the temperature, kg is Boltz- 
mann’s constant, and F1/2 Fermi’s integral of the order 1/2: 


ae enh ae 
Fiala) = | aes (6.6) 


The constitutive equations link the classical drift-diffusion equations to the 
quantum mechanical model for the electronic structure. Typically one derives 
parameters in a constitutive law by upscaling of electronic structure infor- 
mation. Examples are the band-edges Ee, Ey, and the density of states Ne, 
Ny. 
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6.2 Effective band-edges and densities of states 


In the flat-band case, qy — qFn and qy — qFp are approximately constant 
across the nanostructure. Therefore, we can define the quasi Fermi levels of the 
quantum confined electrons and holes by Ern = qy—qFn and Erp = qpy— Fp, 
respectively. SMQW lasers are usually designed such that they operate in this 
flat-band mode. Assuming thermodynamic equilibrium of confined electrons 
and holes, respectively, their local density distributions for given quasi Fermi 
levels Erp and Epp are calculated by 


ngul) = So ga | F(E) - Brn) Cem) Bethy, (67) 


iEc 


Paw (2 = =J Erp — E; (kp) IE; (23 By 2 dky, (6.8) 


with Fermi’s function 


f(E) = (1+ (E (6.9) 


We introduce the average carrier densities per quantum well by 
n = fo Nqu(z)dz/dgw and p = fo Pqu(z)dz/dgw, where dqw is the thickness 
of the well. For our example quantum well structure from Sect. 5 the local 
carrier distributions are plotted in Fig. 6.1 for different values of the sheet 
concentrations N = ñ dqw and P = P dgw- 

For a calculated band structure we regard the averaged carrier densities 
in their dependence on Eryn, Epp and kpT: 
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Fig. 6.1. Local carrier density distributions (6.7) and (6.8) for different values of the 
sheet concentrations N = P = 2-107 em~?, 3-1071? em~?, 4-1071? cm7? for ambient 
temperature. One observes a stronger confinement of the holes in comparison to the 
electrons. 
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Fig. 6.2. Relation (6.10) (kp-calculation) between electron density n and Fermi 
level Ern relative to the net band edge E. for temperatures T=290K, 315K, 340K. 
The dashed lines indicate the fit to the macroscopic state-equation (6.11). Reprinted 


with permission from [BHK03]. © 2003, IEEE. 


n= n (Ern, kBT), p= p (Erp, kBT), (6.10) 


see Fig. 6.2 and Fig. 6.3 for our example. These relations are fitted to the 
Fermi-Dirac distributions 


Erpn — Ee 
n = NeFij2 pe (6.11) 
= E,- E 
P= No Fij2 (A) (6.12) 


by adjusting the parameters N.,E, and Ny, Ey for a specified reference tem- 
perature To. By (6.5) we obtain the density of state masses me and my. This 
procedure provides quantum corrected band-edges Ee, Ey and density of state 
masses m, and my. Thus, one can treat individual quantum wells as an artifi- 
cial classical material, whose parameters significantly differ from bulk values 
of the quantum well materials. This approach has been applied to the the 
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Fig. 6.3. Relation (6.10) (kp-calculation) between hole density p and Fermi level 
Erp relative to the net band edge E, for temperatures T=290K, 315K, 340K. The 
dashed lines indicate the fit to the macroscopic state-equation (6.12). Reprinted with 
permission from [BHK03]. © 2003, IEEE. 
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Fig. 6.4. Material gain spectra (TE-polarization) according to (6.13) for different 
sheet concentrations and temperatures. Reprinted with permission from [BHK03]. 
© 2003, IEEE. 


MQW structure discussed in Sect. 5. Fig. 6.3 and Fig. 6.2 demonstrate the 
high quality of the fit for this particular structure [BHK03]. 


6.3 Material gain 


Still assuming the flat-band conditions, the band structure and the momentum 
matrix elements (5.1) enter the expression for the material gain 


qha? a 
= f(E: - Ern) (1 — f(E; - E 
D a o pn) (1— f(E; — Erp) x 
jEv 
h Sup [i OE 
keT (Ey — Bj) -huo +? 


(6.13) 


where the last factor includes broadening due to collision processes [End97]. 
The latter have been parametrized by a characteristic intra-band relaxation 
time 7 of 60 fs (I = h/r). cis the speed of light and n, the refractive index. 

For the case of an undoped active region and local charge neutrality the 
calculated gain spectra are drawn in Fig. 6.4 for different excitations and tem- 
peratures. The evolution of the corresponding maximum material gain with 
the carrier density shown in Fig. 6.5 for different temperatures. In [BHK03] it 
is discussed how to fit the calculated peak gain characteristics to a logarithmic 
model g(n) = go log(n/nz), which is used as a state equation in semi-classical 
equations. 
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Fig. 6.5. Maximum material gain for different carrier densities and temperatures 
T=290K, 300K, 310K. Dashed lines indicate the fit to the logarithmic gain model 
g(n) = golog(n/nz). Whereas go = 2155cm™~* is not very sensitive to the tempera- 
ture and has been kept constant here, the transparency density nt = n+(T) roughly 


linearly increases with the temperature. Reprinted with permission from [BHKO3]. 
© 2003, IEEE. 
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Fig. 6.6. Spontaneous radiative recombination rate (6.14) for different tem- 
peratures T=290K, 315K, 340K. Dashed: fit to Rraa = Bn®, with a = 


1.47 (290K), 1.51 (3154), 1.55 (340K). Reprinted with permission from [BHK03]. 
© 2003, IEEE. 


6.4 Spontaneous radiative recombination rate 


The spontaneous radiative recombination rate Rraa is calculated according to 
[Wen] by 


2 
Nr 1 1 2 
Rra =— =r a) Ei — E;) Pi; 
d th2c3eqme dyw An2 a ji 4) [pisl x re i 


J(Ei z Ern) f (Erp = E;) dky. 
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It is shown in Fig. 6.6 together with the fit to Rraa = Bn%. The exponent 
was approximately a = 1.5 which differs from the commonly used models 
corresponding to a = 2. 


7 Avoiding spurious modes by adjusting Ep 


For low band gap semiconductors, the parameters of the eight-band Hamil- 
tonian often cause anomalous band bending, see Sect. 2.3. This may lead 
to spurious modes of the k-p Schrödinger operator for heterostructures, see 
Sect. 3.4. However, it is possible to fullfil the condition ¢ < Cerit by lowering 
the value of Ep (2.10): 


E, > E,=aEy, a<l (7.1) 


This adjustment has only a small influence on the band structure for low 
values of k and therefore leaves the confined states nearly untouched. 
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Fig. 7.1. (a) Calculated conduction and valence band structures for a 
InAszSbi-2/GaSb quantum well with the As mole fraction of x = 0.82 at ambi- 
ent temperature, see [KB*05]. (b) Comparison between eight-band k-p-band struc- 
ture calculations and experiments for a set of InAsSb/GaSb multiple quantum well 
samples with different As mole fractions. Full circles indicate the MQW-edge as 
determined from transmittance, whereas open circles point to the edge as deduced 
from PL. Squares mark the edge as obtained from the energy difference of the lowest 
conduction band (at k = 0) and the highest valence band (at k = 0) according to 
8-band k-p-calculations. One observes a systematic IR-shift of the QW-edge data 
experimentally determined compared to the k-p calculation. This tendency is also 
present in the InAso.s95Sbo.105 bulk-like sample, indicated as ’Reference’. Thus it is 
not likely to be a residual effect of the k-p calculation. (b) reprinted with permission 
from [KB*05]. © 2005 American Institute of Physics. 
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In the limit case Ep — 0 the conduction bands and the valence bands 
decouple, and one arrives at the usual six-band Hamiltonian for valence band 
states. This Hamiltonian still provides a reasonable approximation of the va- 
lence band structure [CC92, For93, MGO94, CC96]. 

The InAs,Sb,_, system is the alloy with the lowest direct band-gap value 
of all IIJ-V-semiconductor materials. This mixed crystal system is of utmost 
interest for infrared (IR) optoelectronic applications such as heterostructure- 
based lasers and detectors [WG*00, GKS00]. In [KB*05] the rescaling ap- 
proach is applied to the calculation of the electronic states in a set of samples 
with different mole fractions x. For a comparison of eight-band k-p calculation 
with the experimentally measured properties of the samples, see Fig. 7.1. 


8 Biased quantum wells 


Typical applications of biased quantum well structures are photonic integrated 
chips consisting of integrated semiconductor laser/electro-absorption modu- 
lator [Bas95, Ch. 6], [Chu95, Ch. 13]. In such devices, the external electric 
field allows to modulate the absorption or the reflectivity for specific spectral 
regions. 

The applied voltage leads to tilted band edges in the quantum well region, 
hence, to meta-stable states. If the applied bias is small with respect to the 
band-edge offsets AF, and AF, of the quantum well to the barrier, then hard 
wall or soft wall boundary conditions can still be applied for the calculation of 
the electronic states [SB87, DF93, ASV98]. The calculated eigenvalues E; (ky) 
yield an reasonable approximation of the real parts of the complex eigenvalues 
corresponding to the meta-stable states of the biased structure. 

As an example we discuss the application of this approach to the calcu- 
lation of the band structure and the local carrier density distributions for a 
13 nm thick InGaAsP quantum well, lattice-matched to InP barriers. In Figs. 
8.1 and 8.2 we present the results of eight-band k-p calculations for various 
values of the electrical field. Due to the Kramers degeneracy the heavy and 
light hole bands are double degenerate for an unbiased quantum well, see Fig. 
8.la. This spin degeneracy is lifted if an electric field is applied to the quan- 
tum well, see Fig. 8.1b. The spin splitting of the valence bands induced by 
the external electric field is known as the Rashba effect. 

Fig. 8.2 shows titled band edge profiles and carrier density distributions 
for different applied biases. For the electric field strength F = 6 V/um one 
observes the onset of accumulation of the hole density in the barrier region, 
marking the limit of the approach based on the use of hard wall boundary 
conditions. 
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Fig. 8.1. Band structure ({100] direction) in a 13 nm InGaAsP quantum well lattice- 
matched to InP barriers for two different external electric fields. (a) F=0 V/ym, 
and (b) F=4 V/um. At F=0 V/um, the valence bands are double degenerate. At 
F=4 V/ym, they split due to the applied external electric field (Rashba effect). 
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Fig. 8.2. Tilted band edge profiles E.(z) and E,(z) and local density distributions 
for electrons and holes in a 13 nm InGaAsP quantum well lattice-matched to InP 
barriers for different external electric fields. F=0 V/um, 2 V/um, 4 V/um, 6 V/um. 
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Summary. We consider the modeling, simulation, and optimization of microstruc- 
tural cellular biomorphic ceramics obtained by biotemplating. This is a process 
in biomimetics, a recently emerged discipline in materials science where engineers 
try to mimick or use biological materials for the design of innovative technological 
devices and systems. In particular, we focus on the shape optimization of micro- 
cellular silicon carbide ceramic materials derived from naturally grown wood. The 
mechanical behavior of the final ceramics is largely determined by the geometry of 
its microstructure which can be very precisely tuned during the biotemplating pro- 
cess. Our ultimate goal is to determine these microstructural details in such a way 
that an optimal mechanical performance is achieved with respect to merit criteria 
depending on the specific application. Within the shape optimization problem the 
state variables are the displacements subject to the underlying elasticity equations, 
and the design variables are the geometrical quantities determining the microstruc- 
ture. Since a resolution of the microstructure is numerically cost-prohibitive, we use 
the homogenization approach, assuming periodically distributed microcells. Adap- 
tive mesh-refinement techniques based on reliable and efficient a posteriori error 
estimators are applied in the microstructure to compute the homogenized elastic- 
ity coefficients. The shape optimization problem on the macroscopic homogenized 
model is solved by primal-dual Newton-type interior-point methods. Various numer- 
ical experiments are presented and discussed. 


1 Introduction 


Biomimetics, also called bionics or biomimicry, is a discipline in materials 
science that has recently attracted a lot of attention. It allows the cost- 
effective production of high performance technological devices and systems 
by either mimicking or using naturally grown biological structures (cf., e.g., 
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[Eli00, HDL02, GA88]). In contrast to engineering materials, biological struc- 
tures exhibit a hierarchically built anatomy, developed and optimized in a 
long-term genetic process. Their inherent cellular, open porous morphology 
can be used for liquid or gaseous infiltration and subsequently for high tem- 
perature reaction processes. A specific example of such a naturally grown 
biological material is wood which exhibits an anisotropic, porous morphology 
with excellent strength at low density, high stiffness, elasticity, and damage 
tolerance. Typical feature of the wood structure is the system of the tracheidal 
cells which provide the transportation path for water and minerals within the 
living tree. This open porous system is accessible for infiltration of various 
metals. 

A recent idea in biomimetical applications is to take advantage of naturally 
grown wood in the production of high performance ceramics to be used as fil- 
ters and catalysts in chemical processing, heat insulation structures, thermally 
and mechanically loaded lightweight structures, and medical implants (for in- 
stance, for bone substitution). In particular, silicon carbide (SiC) is known as 
a material that is not only suitable in microelectronical applications due to its 
bandwidth structure but also useful in mechanical and high temperature appli- 
cations with regard to its excellent thermomechanical properties. Since wood 
essentially consists of carbon (C), the idea is to use it as a basic material for 
the production of highly porous ceramics. Among the large variety of ceramic 
composites, new biomorphic cellular silicon carbide ceramics from wood were 
recently produced and investigated, see [GLK98a, GLK98b, OT*95, VSG02]. 
The conversion of naturally grown wood to highly porous SiC ceramics is 
done by a process called biotemplating which includes two processing steps, 
illustrated in Fig. 1.1. 

Biological porous carbonized preforms (also called C—templates) can be 
derived from different wood structures by drying and high-temperature py- 
rolysis at temperatures between 800 and 1800°C and used as templates for 
infiltrations by gaseous or liquid silicon (Si) to form SiC and SiSiC-ceramics, 
respectively. During high-temperature processing, the microstructural prop- 
erties of the bioorganic preforms were retained, so that a one-to-one reproduc- 
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Fig. 1.1. Processing scheme of SiC-ceramics from wood 
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Fig. 1.2. SiC-ceramic derived from pine wood a) radial direction; b) axial direction 


tion of the original wood structure was obtained, see Fig. 1.2. The resulting 
cellular composites show low density, high specific strength, and excellent high 
temperature stability. 

The geometry of the final ceramics, i.e., the widths and lengths of the 
different layers forming the struts, can be determined very precisely by an 
appropriate tuning of the process parameters. This raises the question how to 
choose these microstructural geometrical data in order to achieve an optimal 
performance with respect to a prespecified merit criterion depending on the 
specific application. From a mathematical point of view, this issue represents 
a shape optimization problem where the state variables are subject to the 
underlying elasticity equations and the microstructural data serve as design 
variables. As far as the solution of such a shape optimization problem is con- 
cerned, the resolution of the microstructure is cost-prohibitive with respect 
to both computational time and storage. Therefore, the idea is to perform 
homogenization, assuming a periodically distributed microstructure, and to 
apply the optimization to the homogenized model. 

In this study, we focus both on the homogenization process and on the 
application of state-of-the-art optimization strategies for the numerical solu- 
tion of the shape optimization problem under consideration. The remaining 
of the paper is organized as follows: In Sect. 2, we describe in detail the ho- 
mogenization technique that provides a macromechanical model where the 
components of the homogenized elasticity tensor reflect the microstructural 
details. Section 3 deals with the setting of our shape optimization problem. In 
Sect. 4, we present a primal-dual Newton interior-point method and in Sect. 5 
we comment on the numerical solution of the condensed primal-dual system. 
Section 6 concerns adaptive grid-refinement procedures based on a posteriori 
error estimators. In particular, we use the Zienkiewicz-Zhu error estimator 
proposed in [ZZ87]. Iterative solution techniques for the homogenized elas- 
ticity equation and the microcell problem are discussed in Sect. 7. Various 
numerical results are given in Sect. 8. 
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2 Homogenized computational model 


In this section, we briefly explain the derivation of the homogenized compu- 
tational model on the macroscale by using the asymptotic homogenization 
theory. Homogenization has been successfully used in the last three decades 
for solving multi-scale problems on computational regions occupied by hetero- 
geneous microstructural materials (see, cf., [BP84, BLP78, JKO94, SP80]). 

Let 2 C RI, d = 2,3, be a domain occupied by a heterogeneous ma- 
terial with microstructures of periodically distributed constituents. Suppose 
that the boundary of 2, denoted by I’, consists of a prescribed displacement 
boundary Ip (meas lp > 0) and a prescribed traction boundary Ir, such 
that r = [pUPr, Ip NI = 0. Let b be the body force, ù be the prescribed 
displacement on Ip, and t be the prescribed traction on Ir. 

The homogenized model for our original heterogeneous material occupying 
the domain 2, 2 C R4, d = 2,3, is illustrated in Fig. 2.1. The main idea of 
the homogenization is to replace the heterogeneous material by an equivalent 
homogenized material, extracting the information for the material properties 
of the various microstructural constituents (or different phases). 

The microscopic and macroscopic models are considered simultaneously 
supposing a strong scale separation, i.e., a large gap in length scale between 
the macroscopic component and the microstructure. In practical applications 
the microscopic length scales are orders of magnitude smaller than the physical 
macroscopic length scale. A main assumption in the homogenization approach 
is that the original heterogeneous material workpiece is composed of peri- 
odically distributed microstructures of various constituents. To couple prop- 
erly the micro- and macro-scales, we choose a representative volume element 
(RVE) or a unit microstructure. 

Consider a stationary microstructure with a geometrically simple tra- 
cheidal periodicity cell Y = [0,1]?, d = 2,3, (see Fig. 2.2) consisting of 
an outer layer of carbon (C), interior layer of silicon carbide (SiC), and a 
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| 
=Á —— Homogenized cell 
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Fig. 2.1. The macroscopic homogenized material model 
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Fig. 2.2. a) 3-D unit periodicity cell Y, b) 2-D unit periodicity cell Y = PUSiCUC 


centered pore channel (P, no material). We introduce two space variables x 
(macroscopic/slow variable) and y (microscopic/fast variable) and denote by 
E€, y = x/e, € < 1, the scale parameter (dimensionless number) which, in 
fact, represents the periodicity under the assumption that £ is very small 
with respect to the size of (2, i.e., there exists a large scale gap between the 
microstructure and the macroscopic component. 

The parameter £ allows us to define macrofunctions in terms of the mi- 
crostructural behavior and vice versa. Thus, for any state function f(y) := 
f(a/e), one can compute the spatial derivatives by using the following differ- 
entiation rule 

d ( =) as Of (zx, y) _1 Of (2, y) 
ee a at Mee NEEL, 
dx i Ox Oy 


E 
Consider the following elasticity equation defined in the microstructure Y 


-V-o=F in Y (2.1) 


with a load vector F. Here, ø is the microscopic symmetric stress, u € H'(Y), 
is the corresponding displacement at point y € Y, and e is the microscopic 
symmetric strain with components 


ei;(u(y)) = Gre I =a . (2.2) 





The problem (2.1) is subject to periodic boundary conditions on the outer 
part of OY, Neumann boundary conditions around the pore, and continuity 
conditions [u] = 0 and [ø - n] = 0 on the interfaces between the different 
phases, see Fig. 2.2. The symbol | | denotes the jump of the function across 
the corresponding interface with a normal vector n (cf., e.g., [BP84]). 

Assuming linearly elastic constituents, the unit microstructure is governed 
by the Hooke law ø = E: e with componentwise (i, j, k,l = 1,...,d) consti- 
tutive relations as follows 
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oij(u) = Bijzni(y) exi(u(y)). (2.3) 


Here, we adopt the Einstein convention of a summation on repeated indices. 
The 4-th order elasticity (also called plain-stress) tensor E(y) with compo- 
nents E;jkı(y) characterizes the behavior of the material at point y and de- 
pends on material constants like Young’s modulus and Poisson’s ratio. Note 
that E(y) is zero if y is located in the porous subdomain of the microstructure 
and coincides with the elasticity tensor of the material if y is located in the 
corresponding microstructural constituent. The elasticity tensor is symmetric 
in the following sense 





Eijk = Ejint = Eijik = Ektij (2.4) 





and satisfies the following ellipticity conditions 
Eight XigXkl > C Xij» VXij = Xjis 


for a constant c > 0 (cf., e.g., [BP84, BLP78, JKO94]). 
Denote by ue(a) := u(a/e) the unknown macroscopic displacement vector 
and consider the following family of elasticity problems 


-V-o.=b nA, (2.5) 


subject to a macroscopic body force b and a macroscopic surface traction t 
applied to the portion I'r C ôR. Here, o-(u-) := E-(x)e(u-(x)) is the stress 
tensor for x € 2 and E(x) := E(a/e) = E(y) is the piecewise constant 
elasticity tensor defined in Y. Following, for instance, [BLP78] for the basic 
concepts of the homogenization method, the unknown displacement vector is 
expanded asymptotically as 


u(x) = UO (z, y) teu (x,y) te u%(ay)t..., y=æ/e, (2.6) 


where u (x, y), i > 0, are Y—periodic in y, i.e., take equal values on opposite 
sides of Y. Consider the space H := {u|u € H'(Q), u = 0 on Ip}. Under 
the assumptions of symmetry and ellipticity of the elasticity coefficients, it 
was shown in the homogenization theory that the sequence {us} of solutions 
of (2.5) tends weakly in H as e > 0 to a function wu (æ) € H, the solution 
of the following macroscopic homogenized problem with a constant elasticity 
tensor. 

-V-o =b in Q, (2.7) 


where ø = o(u) := E” e(u®(x)), £ € 2, and E” stands for the homog- 
enized elasticity tensor. Note that u (æ) depends only on the macroscopic 
variable x and is independent of the microscopic scale y. The leading term 
u) in (2.6) is called a macroscopic displacement and the remaining terms 
u, i > 0, are considered as perturbed displacements. 

The homogenization method requires to find periodic functions satisfy- 
ing the following problem in a weak formulation to be solved in the microscopic 
unit cell 
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Oy; 
where œ € H'(Y) is an arbitrary Y—periodic variational function. The func- 
tion E, also referred to as the characteristic displacement, is found by solv- 
ing (2.8) in Y with periodic boundary conditions. After computing ¿*' 


defines the homogenized coefficients by the following formulas (we refer to 
[BP84, BLP78, JKO94] for details) 


1 oek! 
Eph = Fl, (B (y) — Fmt) dY. (2.9) 


Due to the symmetry conditions (2.4), the 4-th order homogenized elas- 
ticity tensor E” = (Ev) can be written as a symmetric and usually a dense 
matrix. In the case d = 2 it is a 3 x 3 matrix and has the form 

Ein Eñ Ein 
E” = Eža Ex» i (2.10) 
SYM Eni 


Ha) i 0 i 
[ Pints — - a= | Bua E dY, (2.8) 
q J 


The 3-d homogenized tensor can be written, respectively, as a 6 x 6 matrix 


H H H H H H 
Eii Eii22 Fiiss Eiiiz Eii23 Fiiis 
H H H H H 
E222 E3233 E212 E3223 E3213 
H H H H 
EE a E3333 E3312 E3323 E3313 2.11 
E E E 
1212 1223 1213 
H H 
E2323 E2313 
H 
SYM EB 


The computation of the homogenized elasticity coefficients can be done 
analytically for some specific geometries as, for instance, layered materials 
or checkerboard structures. In case of more complicated microstructures, the 
computation of Eph has to be done numerically through a suitable micro- 
scopic modeling. 

Once the constant homogenized coefficients from (2.9) are computed, one 
comes up with the homogenized macroscopic equation (2.7) given in a weak 
form as follows 


f PbS an= f b-van+ f t-vdI, vveH, (2.12) 
Q Ox] Ox; Q Ip 





where w(x) := u (x,y) is the homogenized solution occurring in (2.6). 


3 Shape optimization by primal-dual methods 


Structural optimization has recently become of increasing interest in computer 
aided design and optimization of composite structures in materials science (cf., 
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e.g., [Ben95] and the references therein). A typical problem of structural opti- 
mization is to minimize a function (called objective, cost or criterion function) 
over a set of geometrical or behavioral requirements (called constraints). The 
set of structural parameters includes the so-called state and design param- 
eters, and the problem consists in computing optimal values of the design 
parameters, such that they minimize the specific objective function. Sizing, 
shape, and topology optimization problems are different types in structural 
optimization. Detailed classification of these problems is given, for instance, 
in [OT83]. In the sizing problems, the goal is to find the optimal thickness 
distribution of a given material structure. The main difficulty in shape op- 
timization problems arises from the fact that the geometry of a structure is 
a design variable which means, in particular, that the discretization model 
associated with the structure has to be changed in the process of optimiza- 
tion, see [All02, SZ92]. In the topology optimization of solid structures we are 
interested in the determination of the optimal placement of material in space, 
i.e., one has to determine which points of space are material and which points 
should remain void (no material). Hence, the main goal of these problems is 
to find the location of holes and the connectivity of the domain, see [BS03]. 

Our goal is to optimize mechanical performances of the ceramic composites 
described in Sect. 1 (such as the compliance or the bending strength) taking 
into account technological and problem specific constraints on the state and 
design parameters. Denote the state variables u = (u1,...,uw)’, which are 
the nodal values of the components of the discrete displacement vector, and 
the design variables œ = (a4,...,@,4)’ chosen as the microstructural data 
determining the geometry of the periodicity cell (widths and lengths of the 
different materials layers forming the cell walls, see Fig. 2.2). Since the geo- 
metrical properties of the final ceramics are not fixed but can be changed and 
precisely tuned within the processing, we focus on shape optimization of our 
microcellular SiC ceramic materials. Depending on the specific application, 
the objective functional J = J(u, aœ) can be chosen according to the following 
criteria: 


mechanical properties (minimum compliance), 
loading (bending, tension, torsion), 

thermal properties (shock resistance), 
technological properties (minimum weight), 
economical properties (cheapness). 


For simplicity, we consider the mean compliance of the structure defined 
as 


s(u.a) = f b-uan+ | t-udl, (3.1) 
R Ir 
Our shape optimization problem reads: Find (u, œ) E€ RN x R™ such that 


J(u, a) = int J(v, 8), (3.2) 


V, 
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subjected to the following equality and inequality constraints 
M 
A(a)u = f, gla) := 5 ai =C, Qmin Ê < Q < Qmax Ê, (3.3) 
i=1 


where J(u, œ) is defined by (3.1), A(q@) is the stiffness matrix corresponding to 
the homogenized equilibrium equation (2.12), u is the discrete homogenized 
displacement vector, f is the discrete load vector, and € = (1,1,...,1) € 
RM. Note that Amin and Qmax are technologically motivated lower and upper 
bounds for the design parameters. In the case of unit microstructure Y, we 
take the limits Qmin = 0, Qmax = 0.5, and 0 < C < 0.5. 


4 Primal—dual Newton interior—point method 


In the optimization algorithm, we are typically faced with constrained non- 
convex nonlinear minimization problems with both equality and inequality 
constraints on the state variables and design parameters. For the discretized 
optimization problem we use the primal-dual Newton interior-point meth- 
ods, recently a topic of intensive research [BHN99, ET*86, FGW02, GOW98, 
HPS02, HP04b, VS99]. The main idea of these methods is to generate it- 
eratively approximations of the solution which strictly satisfy the inequality 
constraints. Details are given in this section. 


4.1 General nonlinear optimization problem 


We consider the following general constrained nonlinear nonconvex program- 
ming problem with both equality and inequality constraints 


min, f(@), (4.1) 
subject to 
h(x) = 0, g(x) > 0, (4.2) 


where f : R” > R, h: R” > R™,m <n, and g : R” > R! are assumed to 
be twice Lipschitz continuously differentiable. Note that the constraints (4.2) 
have to be understood componentwise. 

The Lagrangian function associated with (4.1)-(4.2) is defined by 


L(a,y,2z) = f(x) + y* h(a) — z" g(a), (4.3) 


where y € R™ and z € R! are the Lagrange multipliers for the equality and 
inequality constraints, respectively. 

The first-order Karush-Kuhn-Tucker (KKT) necessary conditions for op- 
timality of (4.1)—-(4.2) read 
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Vael(z,y,z) =0, h(x) =0, g(x) > 0, Zg(x) =0, z 20, 


where 
l 


VæLl(æx,y,z)= V f(s) + 5 yiVhi(æ) — 5 ziVgi(æ) (4.4) 
i=1 i=1 
is the gradient of the Lagrangian function and Z is the diagonal matrix with 


a diagonal z. We also consider the Hessian of the Lagrangian with respect to 
x defined by 


m l 
V2L(a,y,z) = V? f(x) + > yiV7hi(a) — X` iV? gi(£), (4.5) 


i=l 


where V? f(a), V7hi(x), 1 <i < m, V7gi(x),1 <i <I stand for the Hessians 
of f(a), hi(a), and g;(x), respectively. Denote by 


A(z) = {i, gi(w) =0, i=1,..., 


the set of all indices for which the inequality constraints are equal to zero 
at x. We are interested in finding local minimizers of our optimization prob- 
lem (4.1)-(4.2). Assume that at least one such point x* exists satisfying the 
conditions: 


e Feasibility. h(x*) = 0 and g(a*) > 0. 

e Regularity. The set {Vhi(x*),...,VAm(a*)} U {Vgi(x*), i E€ A(a*)} of 
gradients of equality and active inequality constraints is linearly indepen- 
dent. 

e Smoothness. The Hessian matrices V? f(x), V?hi(x), 1 < i < m, and 
V7gi(@), 1 <i <1, exist and are locally Lipschitz continuous at x*. 

e Second-order sufficiency condition. n7 V2,£(x*)n > 0 for all vectors 
n # 0 satisfying Vh;(a*)'n = 0, 1 < i < m, and Vgi(æ*)Tn = 0, i € 
A(ax*). 

e Strict complementarity. z} > 0 if gj(a*)=0, 1<i<l. 


Well-known approaches from the optimization theory for handling prob- 
lems with inequality constraints are, for instance, the slack variable approach, 
the active set strategy, and the logarithmic barrier function approach. Each of 
these approaches results in a nonlinear programming problem with only equal- 
ity constraints. For example, in the first approach, the constraint g(x) > 0 
can be replaced by g(x) — s = 0,s > 0 by adding a nonnegative slack variable 
to each of the inequality constraints. Transformation of the original inequality 
problem into an equality one, by adding slacks, have been a frequently applied 
tool in scientific computations in the past twenty years and recently used in 
cf., [BHN99, ET*86, VS99]. The introduction of slack variables is associated 
with a small amount of additional work and storage, since they do not enter 
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the objective function and are constrained by simple bounds. The second, 
active set strategy, approach in nonlinear programming is directly related to 
the idea of the simplex method in linear programming. At each iterative step 
from this approach, applying, for example, Newton’s method, one has to define 
which constraints are active at the solution and treat them as equality con- 
straints by ignoring the others. The third approach was used in our practical 
implementations and we explain it in detail in the next subsection. 


4.2 Logarithmic barrier interior-point method 


The logarithmic barrier function method was first introduced in [Fri55] and 
later on popularized by [FM68] in the late sixties of the last century. The 
basic idea of this method is to replace the optimization problem (4.1)—(4.2) 
with the following equality constrained optimization problem 


i (p) 
wi, 8” (æ), (4.6) 
subject to 
h(x) =0, (4.7) 


where p is a positive scalar, called barrier parameter, and 


l 
BP (æ) = f(x) — p 2log gi(æ) (4.8) 


is often referred to as a barrier function. To insure existence of the logarithmic 
terms in (4.8) we implicitly require g;(a) > 0, 1 < i < l. In such a way, we 
get a family of subproblems depending on p for which it is well-known that 
under the assumptions conditions from Sect. 4.1 the solution of (4.6)-(4.7) 
converges to a solution of the original problem (4.1)-(4.2) as p decreases to 
zero (cf., [FM68]). This method obviously is an interior-point method since it 
keeps the sequence of iterating solutions strictly feasible with respect to the 
inequality constraints. Note that the logarithmic terms serve as a barrier and 
result in finding a solution æ”) such that g(a?) > 0. The solution points 
x‘) parameterized by p define the so-called central path or also called barrier 
trajectory. 
The gradient of (4.8) is given by 


l 
VA (x) = Vle) -X yale) 


and the Hessian of 39) (x) is defined by 


i 


l 
V28 (x) = V’ s(æ)-5 AG V*9i(@) +> Bray Me) g(a)”, (4.9) 
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The Lagrangian function associated with (4.6)—(4.7) is 
LO (x,y) = Bw) + y? h(a) -yio gil) + yTh(æ) 


and the gradient of L“) (a, y) with respect to æ is given by 


l 


VaL” (x,y) = V(x) -Y 


i=1 g(x) 





Voile )+ = yiV hax (4.10) 


The logarithmic barrier function method consists now of generating a se- 
quence of iterative solutions {a} = {a ?)}, local minimizers of the equality 
constrained subproblems, with p > 0 decreasing at each iteration. Taking into 
account the first-order optimality conditions and especially Val (a, y) = 0, 
we see that convergence of {a“°)} to an optimal solution æ* requires that 


A (p) __ . . P ys . 

lim y; =y, l<i<m and e O 7 TStSS, (4.11) 
where {y;} and {z7} are the Lagrange multipliers associated with the equal- 
ity and inequality constraints g;(a)) > 0, respectively. From g;(a)) — 0 
and the second relation in (4.11) we get p/g?(æ(®) — œo and hence, the Hes- 
sian of the logarithmic barrier function (4.9) would become arbitrarily large. 
Comparing now relations (4.4) and (4.10), we see that p/g;(a)) serves as a 
Lagrange multiplier for the inequality constraints. Thus, we can introduce an 
auxiliary variable z; = zP = plgi(x), 1 <i < l which can also be written 
in the form zi? ) gi(a)) = p. The last relation is usually called perturbed com- 
plementarity and can be used as a remedy, so that the differentiation will not 
create ill-conditioning. 

We formulate now the perturbed KKT conditions for the logarithmic bar- 
rier function problem (4.6)—(4.7), namely 


V f(z) + Vh(a)y — Vg(x)z =0, h(x) =0, Zg(x) = pē, g(x) > 0. (4.12) 


In matrix-vector notations, (4.12) results in the following nonlinear equation 
with n + m + l components 


t+Jiy -Jiz 
F®(x,y,z)=0 with F®(æ,y,z) = h , (4.13) 
Gz—pe 


where F®) = VL) is the gradient of the Lagrangian function with respect 
to x,y, and z; t = Vf(a) is the gradient of the objective function, Jeq is 
the Jacobian m x n matrix of the equality constraints h(x) = 0 and Jin 
is the Jacobian | x n matrix of the inequality constraints g(a) > 0. In the 
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last equation of (4.13) we have denoted G = diag(g;), gi > 0, 1 <i < land 
é = (1,1,...,1)”. Note that at each iteration we have three sets of unknowns: 
the primal variable x, the dual variable y, and the perturbed complementarity 
variable z which we consider independently. 

Denote the unknown solution by ® = (a#,y,z) and apply the New- 
ton method to the nonlinear system (4.13) to find the increments AS = 
(Az, Ay, Az), namely 

KAS = -FP (6), (4.14) 


which is often referred to as a primal-dual system. The vector A® is called 
search direction. The so-called primal-dual matrix K = (FOY (S) of second 
derivatives of the Lagrangian function is defined as follows 


H J-J 
K=| Ją 0 0 J, (4.15) 
Zin 0 G 


where the Hessian of the Lagrangian function H = V2ZL is given by (4.5). 
Note that the matrix K is sparse, nonsymmetric, independent of p, and usu- 
ally well-conditioned in a sense that its condition number is limited when 
p — 0 (see [Wri98] for more details). In the case of convex optimization (i.e., 
convex objective function f(x), linear equality constraints h(a), and concave 
inequality constraints g(x)), the Hessian matrix H is positive semidefinite. 
The properties of the Hessian matrix for inequality constrained optimization 
problem with logarithmic barrier function method are discussed in [FGW02]. 

One possible way for solving (4.14) is to symmetrize K taking into account 
the fact that Z and G are diagonal matrices. This method is proposed in 
[FGS96] and results in the following symmetric matrix 


i H Ji, -J 
I= Ją 0 0 ; 
-Jn 0 -ZIG 


which is strongly ill-conditioned with some diagonal elements becoming un- 
bounded as p — 0. In particular, for the active inequality constraints, the 
diagonal entries of Z~'G go to zero, and for the inactive constraints they go 
to infinity. As the iterates converge, the ill-conditioning of K increases, but 
it was shown in [FGS96] that the primal-dual solution of the optimization 
problem is actually independent of the size of the large diagonal elements and 
can be found by using, for instance, a symmetric indefinite factorization of 
the primal-dual system. 

Another alternative way for solving (4.14) which we use in our practical 
applications is to eliminate the (1,3) block of (4.15), i.e., due to g(a) > 0, we 
eliminate Az from the third equation of (4.14) 


Az = =z + G7! (pē — Z Jm Az) (4.16) 
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and replace it in the first equation. This method produces a symmetric linear 
system with n + m equations of the form 


H JEN (Aæ) __ (t+ Jay — pJinG E (4.17) 
Jeq 9 Ay] — h 


where H = H + JEG OZ Jin is often referred to as a condensed primal-dual 
Hessian and (4.17) is called a condensed primal-dual system. A detailed anal- 
ysis of the properties of the condensed primal-dual matrix can be found in 
[Wri98] where it was shown that the inherent ill-conditioning of the reduced 
primal-dual matrix is usually benign and does not influence the accuracy of 
the solution. 

Various methods for solving (4.17) and finding the primal-dual steps 
(Aa, Ay) are proposed in the literature (cg., e.g., [ET*86, GOW98, Wri98]). 
Note that one needs a reliable and efficient solver of (4.17), since the con- 
densed primal-dual system is solved at every iteration of the optimization 
loop. In practice, we apply transforming iterations (see [Wit89]) to find the 
increments. This method will be explained in more detail in Sect. 5. 

After finding the solution of (4.17), the algorithm proceeds iteratively from 
an initial point (a), y©, 2) through a sequence of points determined from 
the search directions described by (4.16) and (4.17) as follows 


htt) = ol) +a Ag, ytd = y® 4 aP Ay, zt) = 20h) 4 a Az. 
The parameters aP, aP aP € (0, 1] are called steplengths and their choice 


at each iteration is a critical feature of the algorithm to find a local minimizer 
of the optimization problem. 


4.3 Merit functions. Computing the steplengths 


In all optimization algorithms it is important to have a reasonable way of 
deciding whether the new iterate is better than the previous one, i.e., it is 
essential to measure appropriately the progress in finding a local solution. 
Merit functions of different types have been a subject of great interest over 
the past years (see, e.g., [ET*86, GOW98, Wri98]). The main idea of a merit 
function is to ensure simultaneously a progress toward a local minimizer and 
toward feasibility. The method of choosing a) at each iteration becomes 
more complicated in general nonlinear programming problems as it is well- 
known that the Newton method may diverge when the initial estimate of the 
solution is bad. 

Two versions of the Newton method can be applied, namely, the trust- 
region and the line-search approach. The first method has recently been ap- 
plied in, e.g., [BHN99]. Typical for this method is to find a step d*) which 
is restricted to a set, called the trust region. This set is practically obtained 
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by limiting |d™)| < r®, where r(*) is the trust region radius. At each it- 
eration, r‘*) is updated according to how successful the step has been. For 
instance, if the a priori chosen merit function M decreases, we accept the 
step ad), update the solution Bt) = 6 + aq and possibly increase the 
trust region radius r). Otherwise, we decrease r(*) by a damping factor, e.g., 
r(k) = r*) /2 and compute again the step d*), 

We apply the second variant of the Newton method, the line-search ap- 
proach. Once the solution A® of (4.14) has been determined, we find a 
steplength a) > 0 such that BY — 6 4 Qh) AB measuring a progress 
in minimization at each iteration and reducing the merit function in the sense 
M(@"*)) < M(®). The ideal value a) = 1 may not always happen 
so that various modifications of the basic Newton method have to be imple- 
mented. The following basic model algorithm can be considered: 


S1.If the conditions for convergence are satisfied, the algorithm terminates 
with 6") as the solution. 


S2. Compute a search direction A®™) solving (4.14). 

S3. Compute the steplength a) > 0 for which M(6™ + a” Ad™) < 
M(o), 

S4. Update the estimate for the minimum by Bt) — pH) tal) Ap), kis 
k+ 1, and go back to step S1. 


A standard convergence monitor in nonlinear programming is to choose 
the Euclidean norm || F®(æ,y,z)|| of the residual produced by the KKT 
conditions (4.13) as a merit function. However, in many practical implemen- 
tations, this choice of the merit function is not sufficient, since it does not 
allow to tell the difference between a local minimizer and a stationary non- 
minimizing point. The KKT conditions are necessary optimality conditions 
and hence, the optimization problem (4.1)-(4.2) and the nonlinear problem 
(4.13) are not equivalent, i.e., the Newton method may find solutions of (4.13) 
which do not minimize the objective function f(æ). Therefore, in order to 
find simultaneously solutions of both problems, a better approach is to rely 
on a hierarchy of two merit functions (cf., e.g., [GOW98, HPS02]). In general, 
the choice of merit functions in nonlinear constrained optimization problems 
is complicated. Several ideas have recently been proposed in the context of 
primal-dual interior methods (cf., e.g., [BHN99, ET*86, FGW02]). In particu- 
lar, our primary merit function is related to those suggested in [GOW98] and 
is chosen as a modified augmented Lagrangian incorporating the logarithmic 
barrier function (4.8) as follows 


l 
M := M(w,y,p,pa) = f(æ) -p Y log gile) + y7h(a) + 5pah(o)" We), 


(4.18) 
where p4 is a positive parameter. Our purpose now is to satisfy the descent 
conditions and to guarantee a reduction of the merit function in the sense that 
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each iterate should be an improved estimate of the solution of (4.6)—(4.7). Note 
that a descent is sought only with respect to a taking into account the original 
optimization problem. A standard way to achieve M(a + aAz,y,p,pa) < 
M(a, y, p, pa) is to require that Az is a descent direction, i.e., Ax? VaM < 0, 
where Vz is the gradient of the primary merit function with respect to the 
primal variable æ. In particular, we have 


Ag?’ VgM = Aa’ (t — pJ} G'E + J4 y + padgh) 
= Aa? (t — pJEGTte) — h"y — pah" h, (4.19) 

due to JegAx = —h from the second equation of (4.17). Hence, AxTV zM < 0 
can be satisfied if the augmented Lagrangian parameter p4 is sufficiently large, 
namely 

AxT (t — pJEG-!E) — hy 
an = 

h` h 

Hence, pa can be changed within the optimization loop, if Ag is not a descent 
direction. In our algorithm, we choose 


PA 


pa = min (rrt — pJ£GTtē) — hy), 100) (4.20) 


in the case Az? VgM > 0 and continue the loop (see [GOW98, HPS02] for 
details). 

For the secondary merit function we choose the la— norm of the resid- 
ual with respect to perturbed KKT-conditions (4.13). We apply the Newton 
method and choose the steplengths to strictly satisfy the inequality constraints 
g(x) > 0 and the complementarity constraints z > 0. Hence, the first require- 
ment for the line-search approach is to insure a strict feasibility. Let â and + 
be separate steplengths defined as follows 


â = max{alg(x) + aJin Ax > 0}, 4 = max{y|z+ yAz > 0}. 


Since we maintain interior (i.e., strict feasible) iterates, usually we take a 
parameter + € (0,1) bounded strongly away from unity and define a = 
min(1,7@) and y = min(1,77). We use the same steplength y for the La- 
grange multiplier y. In practice, both merit functions are used by means of 
the following strategy: If the steplengths a and y lead to a reduction of M, 
they are accepted. If M does not decrease, we check the secondary merit func- 
tion. If the latter decreases, the steplengths are accepted; otherwise damp the 
Newton steps by a certain factor and continue the procedure. The barrier pa- 
rameter p > 0 is updated by decreasing values until an approximate solution 
of the nonlinear problem is obtained (cf., e.g., [ET*86, GOW98, HPS02]). We 
rely on a watchdog strategy (see [CL*82]) to ensure progress in finding a local 
minimizer. If after some fixed number of iterations there is no reduction of 
M, the augmented Lagrangian parameter pa is chosen sufficiently large in 
accordance with (4.20). 
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5 Solving the condensed primal—dual system 


The discretized constrained optimization problem (3.2)—(3.3) is solved by the 
primal-dual interior-point method described in Sect. 4. We consider the diag- 
onal matrices Dı := diag(a;—Qmin) and Də := diag(Qmax — a;) and introduce 
Z= pDī'ē > 0 and w = pD; lē > 0 serving as perturbed complementarity. 
We note that 1 < i < N where N is the number of finite elements in the dis- 
cretized domain and ē = (1,1,...,1)" € RM. The primal-dual Newton-type 
interior-point method is applied to three sets of variables: primal feasibil- 
ity (u, œ), dual feasibility (A, 7), and perturbed complementarity related to 
(z,w). 
Denote the Lagrangian function of (3.2)—(3.3) by 


L(u,a;rA,;2,w) := f(u, a) (5.1) 
+ A*(A(a) u— f) +n (gla) — C) 
— zT (a — Amin €) — wT (Amax€ —a). 


The Newton method applied to the KKT conditions of (5.1) results in 


Lau Laa Lax Lan —-I I Aa Val 
Lyu La 9 0 0 OJJAA]T__ | Vy (5.2) 
nma 0 0 00 An V,£ |”? j 
0 Z 0 0 D, 0 Az Vz£ 
0 -W 0 0 0 D Aw Vwe 


where J stands for the identity matrix, Z = diag(z;) and W = diag(w;) are 
diagonal matrices. Following Sect. 4 we eliminate the increments for z and w 
from the 5th and 6th rows of (5.2), namely, 


Az = Dy\(-Vz£—Z Aa), Aw = Dy*(-VwlL+W Aa) (5.3) 


and substitute (5.3) in the second row of (5.2). We get the linear system 
KAw = —€ for the increments of p := (u,a,A,7), denoted by Ay := 
(Au, Aa, Ad, An) where K is the matrix and (—€) is the right-hand side of 
the following condensed primal-dual system 


Lau Laa Lay Lan Aa __| Val (5.4) 
Lyu LN 0 0 AA V£ ` 
0 La 0 0 An Vn£ 


The aa-entry of K and the modified entry for the right-hand side are 
aa = Laa + Dy BEDS WwW, Val=VaLlL+D'VzL- De Vl. 


Direct methods for the solution of (5.4) can be divided into two classes: 
range space methods and null space methods. These approaches essentially 
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differ in the grouping of the matrix into a 2 x 2-block structure. The de- 
composition of the condensed primal-dual system (5.4) is related to the first 
approach. In this section, we consider the null space decomposition of the con- 
densed primal-dual matrix interchanging the second and the third rows and 
columns. The resulting matrix can be written according to 


rs Ait A12 
K = = 
ee A22 
where the first diagonal block 


Ay = (a Saa) (5.5) 


Au 





is now an indefinite but nonsingular matrix. We remind that Ly,, = A(q@) is 
exactly the stiffness matrix corresponding to the equilibrium equation (2.12). 
Hence, Ava: exists, and the Schur complement S := Ag: — Ao Ay A12 is 
defined correctly. 

We use the following regular splitting of K 


KIK? = Mı- Mp (5.6) 


with left and right factors given below and reasonable matrices Mı and Ma ~ 
0. For solving the system of the form KA = -€ , Starting with an initial 
guess for Aq := (Au, AA, Aa, An)’, the transforming iteration proposed in 
[Wit89] is described by 





Apert) = Ap) + K®My'K*(-€- KAy™), (5.7) 


where the new iterate '"°w? 


namely 


is obtained by a line-search in the direction Aw, 
new old A 

pe) = Yl") talap) 1<j<4 

The line-search approach and the choice of the steplengths parameters a; are 

discussed in Sect. 4.3. 


Using an appropriate preconditioner for the stiffness matrix, we approxi- 
mate the first diagonal block (5.5) as follows 


0 L£ 0 Ê 7 
Au = uà) z uà | =: Ay. 
n= (QB) Na 


Usually, the left and right transformations are of the form 


I0 ~£yulra 0 
Rear ade T ie 01 -L Lua 0 
0 I 00 = 0 

00 0 I 
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In this case, the regular splitting (5.6) becomes KKE = Mı — M2 where 


0 Ly x 0 0 


Lyu 0 0 0 Au 4) 
Mı = u x = 5.8 
| Lau Laa 5S Lan Gy. (5:8) 
0 0 La 0 
and 7 
00 Lua — Luz Eu Lua 0 
R—1 
M = | 99 £ra 7 EAuE zua 9 |. (5.9) 
00 0 0 
00 0 0 


Note that Mə ~ 0 if we have a good preconditioner for the stiffness matrix. 
In our numerical experiments, we choose a Cholesky decomposition of Lyu) 
The second diagonal block Q in (5.8) is symmetric and indefinite given by 


S£ i j= £ ie a 
Q:= € > =) with S= Laa-Lauly yu Lia La u Lua 
n 


We denote the defect in (5.7) by d = =f= Kay” and compute the corre- 
sponding entries 

du = —Vul = Lug Aà = Luada, 

dy = -VL = LyyAu — Lyjaa, 

da = —Val — Lau^u — Laz AÀ — aa Aa — LanAn, 

dy = —Vn£ — Lya ^a. 
Taking into account (5.7) one needs to compute 6 = M,'d, i.e., M1ô = d. 
Consequently, we find 6, = L pdu and ðu = Ludy- To compute the 


remaining components of 6 we have to solve systems with an indefinite matrix 
Q of the form 


S Lan \ (a _ (da — Laudu — La 5p 
Lna 0 Ön dn l 
Iterative procedures such as MINRES or Bi-CGSTAB (see [VdV92]) with 
appropriate stopping criteria can be applied in this case. Compute KEÔ and 
find the increments from (5.7) as follows 
Aute) = Au + bu — Ly La Sa, Aa) = Aad + bar, 
ANP) = ANS) + 6. — Loy Luada, Anew) = An) + 6, 








We apply the above algorithm (with a fixed number of iterations) to find 
the increments of the primal and dual variables Au, Aa, AA, An and then 
use (5.3) to determine the global search direction A®. 


414 R.H.W. Hoppe and S.I. Petrova 


6 Adaptive grid refinement 


Advanced finite element applications in science and engineering provoke the 
extensive use of adaptive mesh-refinement techniques to optimize the number 
of degrees of freedom and obtain accurate enough numerical solutions. The 
adaptive framework requires a locally refined discretization in regions where 
a better accuracy is necessary. 

The computation of the homogenized elasticity coefficients requires the nu- 
merical solution of (2.8) with the unit cell as the computational domain. Previ- 
ous works on shape and topology optimization (cf., e.g., [Ben95, BS03, SZ92]) 
strongly suggest the use of locally refined grids particularly at material inter- 
faces. In the context of shape optimization such local refinements have been 
mostly done before the computations relying on a priori geometric informa- 
tions or in an interactive way (manual remeshing based on computational 
results). In case of local singularities of the discrete solution, a priori error 
estimates typically give information about the asymptotic error behavior and 
thus, are not the best choice to control the mesh. In those parts of the do- 
main where the solution changes rapidly, an automatic grid refinement on the 
basis of reliable and robust a posteriori error estimators is highly beneficial. 
In practice, the main goal in adaptive mesh-refinement procedures is to refine 
the mesh so that the discretization error is within the prescribed tolerance 
and as possible equidistributed throughout the domain. 

In the past twenty years, numerous studies have been devoted to an er- 
ror control and a mesh-design based on efficient postprocessing procedures 
(cf., e.g., [CF01, EE*95, HP04a, ZZ87]). A natural requirement for a poste- 
riori error estimates is to be less expensive than the cost of the numerically 
computed solution. Moreover, appropriate refinement techniques have to be 
applied to construct the adaptive mesh and implement the adaptive solver. 
Local reconstruction of the grid is necessary to be done with a computational 
cost proportional to the number of modified elements. 

The a posteriori adaptive strategy can be described as follows: 


Al. Start with an initial coarse mesh Jọ fitting the domain geometry. Set 
n :=0. 

A2. Compute the discrete solution on Th. 

A3. Use a posteriori error indicator for each element T € Jn. 

A4. If the global error is small enough, then stop. Else refine the marked 
elements, construct the next mesh 7,41, set n := n+1, and go to step A2. 


The solution of our linear elasticity equation (2.8) is computed by using 
adaptive finite element method based on the Zienkiewicz-Zhu (referred as ZZ) 
error estimator. For instance, a recovery technique is analyzed in [ZZ87] for 
determining the derivatives (stresses) of the finite element solutions at nodes. 
The main idea of the recovery technique is to develop smoothing procedures 
which recover more accurate nodal values of derivatives from the original finite 
element solution. 
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The necessity of derivative recovering arises from the fact that in the finite 
element approach the rate of convergence of the derivatives is usually one 
order less than that of the discrete solution. In particular, the accuracy of the 
derivatives (stresses) computed by directly differentiating the discrete solution 
is inferior. Therefore, in many practical problems an improved accuracy of the 
stresses at nodes is needed. 

Denote by ø the exact stress, by & the discrete finite element discontinuous 
stress, and by o* the smoothed continuous recovered stress. The computation 
of o* was proposed and discussed in [ZZ87] under the assumption that the 
same basis functions for interpolation of stresses are used as those for the 
displacements. The recovered stress o* is computed by smoothing the discon- 
tinuous (over the elements) numerical stress 6. The smoothing procedure can 
be accomplished by nodal averaging method or the L2-projection technique. 
Note that the components of o* are piecewise linear and continuous. 

The computational of the global L2-projection is expensive and the authors 
of [ZZ87] proposed to use a lumping form of the mass matrix. Thus, the value 
of the recovered stress o* at a node P can be computed by averaging the 
stresses ĝ at the elements that share that node. Denote by Yp C Y the 
neighborhood patch as an union of all triangles/tetrahedra T having node P. 
Consider 7 

o*(P)= So wlrêlr, wlr = AN T € Yp, (6.1) 
TEYp [YP 
i.e., o*(P) is a weighted average of 6 with weights w|r defined on the elements 
belonging to Yp. Least-square technique can also be applied to approximate 
the stress field at a given node. 

It was shown in [ZZ87] that o* is a better approximation to ø than & and 

the following estimate holds 


lo -o"lloy < |lo-Flloy, (6.2) 


where Y is the periodicity microcell into consideration. Furthermore, the re- 
covered technique was used in a formulation of a posteriori error estimator 
by comparing the recovered solution o* with the finite element solution &. In 
particular, the estimate (6.2) allows us to replace the exact (unknown) stress 
o by o* and consider ||o* — ô||o,y as an error estimator. 

In many practical implementations reliability and efficiency are highly de- 
sirable properties in a posteriori error estimation. It basically means that 
there exist constants independent of the discrete solution and the mesh which 
limit the error (in a suitable norm) from below and above. Moreover, techni- 
cally it is better to use local error estimators which are computationally less 
expensive. The following local estimator is considered 


nr := llo” — 6llo,r- (6.3) 


The nodal values of the recovered stresses are found locally. The elementwise 
contributions (6.3) are used further as local error indicators in the adaptive 
mesh-refinement procedure. 
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The global ZZ-error estimator is defined by 


w= (Eren A) oa 


Based on a posteriori processing, the local estimator (6.3) is practically 
efficient providing recovered values are more accurate, i.e., the quality of the 
a posteriori error estimator strongly depends on the approximation properties 
and the accuracy of the recovered solution. 

Arbitrary averaging techniques in low order finite element applications for 
elasticity problems are subject of investigations in [CF01]. In the latter study 
the authors considered the following global averaging estimator 


NA T= min |lo*—6lloy (6.5) 


and proved an equivalence to the error ||o — @||o,y with lower and upper 
bounds independent of the shape-regular mesh. Note that in (6.5) o* is a 
smoother approximation to 6 obtained by any averaging procedure. In partic- 
ular, the final error estimate in [CF01] explains the reliability and robustness 
of the ZZ- a posteriori error estimators in practice. 


7 Iterative solution techniques 


In this section, we comment on the iterative solvers for the microcell problem 
(2.8) defined in Y to find the effective coefficients and for the homogenized 
elasticity equation (2.12) on the global domain (2. After finite element dis- 
cretization of the corresponding domain we get the following system of linear 
equations 

Au= f, (7.1) 


where u is the vector of unknown displacements and f is the discrete right- 
hand side. The stiffness matrix A is symmetric and positive definite but not 
an M-matrix. 

Two typical orderings of the unknowns are often used in practice. In the 
3-dimensional case they are presented as follows 


(ul ul), uf, uP, ul), ul, e ue), u), ul?) ; (7.2) 


referred to as a pointwise displacements ordering and 


(ar uP, P ue), ul), ul), ies UW), ul, ul, sian ul) ; (7.3) 


called the separate displacements ordering. Here, u”, wu), and uf? are the 
corresponding x, y-, and z- displacement components. For the the first or- 
dering (7.2), the resulting stiffness matrix A = A") can be seen as a 
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discretization matrix consisting of elements which are small 3 x 3 blocks. For 
the second ordering (7.3), the matrix A = A(°c*) admits the following 3 x 3 
block decomposition 
Au A12 Ais 
A = Ag Aoo Ao3 . (7.4) 
A31 A32 Ass 


In case of isotropic materials, the diagonal blocks A;;, j = 1,2,3, in (7.4) 
are discrete analogs of the following anisotropic Laplacian operators 


kad — oe? oe? 82 cad — 82 82 oe? cad = oe? oe? 82 
Di=ağz + bag tbgz D2=b37 + age + baz, Da=bae + biz t037 


with coefficients a = E(1—v)/((1+v)(1—2v)) and b = 0.5E/(1 +v) where E 
is the Young modulus and v is the Poisson ratio of the corresponding material. 
This anisotropy requires a special care to construct an efficient preconditioner 
for the iterative solution method. Based on Korn’s inequality, it can be shown 
that A and its block diagonal part are spectrally equivalent. The condition 
number of the preconditioned system depends on the Poisson ratio v of the 
materials and the constant in the Korn inequality. For the background of the 
spectral equivalence approach using block diagonal displacement decomposi- 
tion preconditioners in linear elasticity problems we refer to [BLA94]. Note 
that the spectral equivalence estimate will deteriorate for v close to 0.5 which 
is not the case in our particular applications. 

The PCG method is applied to solve the linear system (7.1). We propose 
two approaches to construct a preconditioner for A: 

(i) construct a preconditioner for A’) 

(ii) construct a preconditioner for A(°’°°*) of the type M = diag(Mj,), 
where Mj; ~ Ajj, j = 1,2,3, are “good” approximations to the diagonal 
blocks of A. In case (i) we have chosen the incomplete Cholesky (IC) factor- 
ization of A with an appropriate stopping criterion. 

An efficient preconditioner for A,; in case (ii) turns out to be a matrix 
Mj; corresponding to a Laplacian operator (—div (c grad u)) with a fixed scale 
factor c. In our case we use, for instance, c = b/2 for all three diagonal blocks. 
Algebraic MultiGrid (AMG) method is applied as a “plug-in” solver for A 
(see [RS86] for details). This method is a purely matrix-based version of the 
algebraic multilevel approach and has shown in the last decade numerous 
efficient implementations in solving large sparse unstructured linear systems 
of equations without any geometric background. 





8 Numerical experiments 


In this section, we comment on some computational results concerning the 
microscopic problem to find the homogenized elasticity coefficients and the 
macroscopic shape optimization problem. For simplicity, we suppose linear 
elasticity with homogeneous and isotropic constituents in terms of carbon 
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and SiC. The Young modulus E (in GPa) and the Poisson ratio v of our two 
materials are, respectively, Æ = 10, v = 0.22 for carbon and E = 410, v = 
0.14 for SiC. 

The computation of the characteristic displacement fields EM and the ho- 
mogenized elasticity coefficients (2.9) requires the solution of linear elastic 
boundary value problems with the periodicity cell Y as the computational 
domain. The elasticity equation (2.8) is solved numerically using a conform- 
ing finite element discretization of the periodicity cell Y by linear basis func- 
tions. Since the periodic displacements €" = €'* are symmetric, the equation 
(2.8) is computed numerically 3 times in the case d = 2 and respectively, 
6 times in the case d = 3. Due to the composite character of our microcell 
there are material interfaces where the solution changes significantly. Hence, 
local refinement of the underlying finite element mesh is strongly advised. As 
discussed in Sect. 6, we use an adaptive grid refinement strategy based on a 
posteriori error estimator of Zienkiewicz-Zhu type obtained by local averaging 
of the computed stress tensor. Note that the adaptivity procedure is local and 
computationally cheap. 

Denote the global density of the solid material part in the microstructure 
by u, 0 < u < 1. Note that the density of the tracheidal cells of the wood 
essentially depends on the growth of the tree. If u is relatively small, we speak 
about an early wood (grown in spring and summer) and respectively, about 
late wood (grown in autumn and winter) for values of u, close to 1. 

We present first some numerical experiments on a plane microstructure 
(d = 2) shown in Fig. 2.2 b). More experiments can be found in [HP04a]. We 
assume that the material layers in the periodicity cell have equal widths from 
all sides of the cell. Denote by a;, i = 1,2, the widths of the carbon and SiC 
layers, respectively. Figure 8.1 a) illustrates the behavior of the homogenized 
coefficient EB in case of square hole versus a1 and œz which vary between 0 
and 0.5. We compute the effective coefficients Ey only for a fixed number of 
values of the design parameters (e.g., 20 x 20 grid as shown on Fig. 8.1) and 
then interpolate the values by splines. With regard to the homogenized state 
equation (2.12), this procedure results in having explicit formulas at hand 
for the gradients and the Hessian of the Lagrangian function needed in the 
optimization procedure. 

In principal, the hole is located inside the microstructure but we find 
interesting to demonstrate the behavior, for instance, of E/4,. depending on 
a rectangular hole [1 — a] x [1 — b], see Fig. 8.1 b). Note that a = b = 0 
represents a complete void, a = b = 1 realizes a complete solid material, and 
0<a<1, 0<b< 1 characterize a general porous material. We consider in 
this example the case when the carbon has completely reacted with the SiC 
which strongly concerns the so-called pure biomorphic SiC-ceramics. Very 
recently, the chemical experiments have shown that the carbon phase limits 
the mechanical properties of the composite materials and restricts their high- 
temperature applications. The final transformation of the original carbonized 
template to pure ceramic composite requires to offer enough silicon during 
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Fig. 8.1. Homogenized coefficient E7519: a) w.r.t. the widths of carbon and SiC 
layers (square hole); b) w.r.t. the sizes 1 — a and 1 — b of the rectangular hole (pure 
SiC-ceramic) 


the infiltration process and to wait an appropriate time until the carbon is 
completely consumed by the silicon resulting in a SiC-phase. 

Figure 8.2 displays the dependence of the homogenized elasticity coeffi- 
cients on the density u of the cell. In particular, we show this behavior versus 
the width of the SiC layer in case of pure SiC-ceramics. Figure 8.2 a) shows the 
behavior of the effective coefficients for early wood (0 < ag < 0.15, u = 51%) 
and Fig. 8.2 b) demonstrates the coefficients for late wood (0 < az < 0.3, 
u = 84%). One can easily observe from both pictures on this figure a highly 
nonlinear behavior of the homogenized coefficients. 

The mesh-adaptive process is visualized in Fig. 8.3. We see that in case 
of one material available in the microstructure, an appropriate refinement is 
done around the corners where the hole with a complete pore is located, see 
Fig. 8.3 a). In case of more materials, additional mesh-adaptivity is needed 
across the material interfaces in the microstructure due to the strongly varying 
material properties in terms of Young’s modulus and Poisson’s ratio. 

In Table 8.1 we give some results for the homogenized elasticity coefficients 
on the first ten adaptive refinement levels for various values of the density. We 
report the number of triangles NT and the number of nodes NN on each level 
when solving problem (2.8). We see from the computed values that the mesh 
sensitivity on the successive levels is very small. Our adaptive mesh-refinement 
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Fig. 8.2. Homogenized coefficients w.r.t. the width a2 of SiC layer for pure SiC- 
ceramic: a) early wood, density 51%; b) late wood, density 84% 
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Fig. 8.3. Late wood, density = 84%, 9 adaptive refinement levels: a) SiC, 1527 
triangles, 818 nodes; b) carbon and SiC, 3752 triangles, 1916 nodes 


procedure stops when a priori given limit for the number of refinement levels 
is reached. 

We are now concerned with the solution of problem (3.2)—(3.3). In Ta- 
ble 8.2 we report some numerical results from running the optimization code 
varying the constant C with respect to (3.3). Our purpose is to find the op- 
timal widths/lengths of the layers in the composite material and to show the 
convergence behavior of the optimization algorithm. The domain (2 is chosen 
to be a circle which corresponds naturally to a cross section of the original 
wood structure. We have fixed the discretization and vary the initial values 
for the lengths of the carbon and SiC layers denoted, respectively, by a and 
a?) As before, we report the number of iterations ITER to get convergence, 
the optimal lengths a; and az of the carbon and SiC layers, the last value of 
the barrier parameter p, the final value of the primary merit function M, the 
lg-norm of the residual, and the l2-norm of the complementarity conditions 
v = (z,w) at the last iteration. We see from the experiments that the optimal 


Table 8.1. Homogenized coefficients w.r.t. refinement level, a) y = 51%, b) u = 84% 


| level | Effi Eiiz2 Eiz | NT | NN | | level | Efi Efize Efgio| NT | NN | 
1 |64.975 7.664 12.116] 168 | 100 1 [33.430 3.885 9.893] 168 | 100 
2 |63.336 6.642 9.750 | 220 | 126 2 |33.064 3.929 9.577| 216 | 126 
3 [58.466 6.682 8.073 | 288 | 162 3 [32.844 4.024 9.283] 300 | 168 
4 56.572 7.012 6.643 | 484 | 262 4 |32.291 4.254 8.970| 544 | 296 
5 [54.385 6.245 6.212 | 712 | 378 5 [32.144 4.312 8.809| 828 | 438 
6 |52.936 6.091 5.474 |1208] 630 6 |31.909 4.372 8.703]1354| 705 
7 |51.914 5.458 5.306 |1800| 932 7 |31.862 4.379 8.526/1892] 980 
8 |50.861 4.790 5.217 |2809]1444 8 |31.735 4.399 8.470]2894|1485 
9 150.455 4.571 5.029 |3754]1919 9 |31.711 4.400 8.373]3752|1916 
10 |49.591 4.359 4.983 |5918|3013 10 |31.487 4.497 8.321]5716]2906 
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Table 8.2. Convergence results for biomorphic microcellular SiC ceramics 
0)| (0 
lar a | C |ITER| a far] o |M [IF|2[lloK2| 


0.05|0.05| 0.3 | 11 |3.6e-12| 0.3 |1.3e-17|1.24| 9.63e-6 | e-10 
0.1 | 0.1 | 0.3 | 11 |5.5e-14| 0.3 |3.0e-21|1.24| 1.03e-6 | e-12 
0.1 | 0.1 | 0.4 | 12 |1.6e-16| 0.4 |1.2e-26|0.85| 8.63e-9 | e-14 
0.2 | 0.2 | 0.1 | 16 |5.5e-17| 0.1 |2.2e-25|7.73| 2.23e-8 | e-13 
0.2 | 0.2 | 0.2 | 13 |1.0e-16| 0.2 |5.3e-26|2.34| 1.54e-8 | e-14 
0.2 | 0.2 | 0.3 | 11 |2.5e-16| 0.3 |6.7e-26|1.24| 1.79e-8 | e-14 
0.24|0.24|0.15| 11 |5.4e-15|0.15|4.1e-12[3.81| 4.99e-7 | e-12 
0.3 | 0.1 | 0.4 | 11 |1.3e-12| 0.4 |8.5e-19|0.85| 5.07e-6 | e-10 
0.4 |0.05| 0.1 | 17 |9.8e-15| 0.1 |6.9e-21|7.73| 9.49e-7 | e-11 


length a, of the carbon layer in all the runs is very close to zero, i.e., the solid 
part of the body is entirely occupied by a silicon carbide layer due to the 
higher stiffness of this material. 

In case of 3-dimensional implementations we decompose the periodic mi- 
crocell Y first in hexahedra and further we use continuous, piecewise linear 
finite elements on tetrahedral shape regular meshes. The adaptive grid refine- 
ment process is visualized in Fig. 8.4. The mesh adaptivity around the material 
interfaces has been realized by means of Zienkiewicz-Zhu type a posteriori er- 
ror which is used heuristically (as an error indicator). One computes the error 
(6.3) locally for each element and mark for refinement those tetrahedra {T} 
for which 

nT y a NTE 


where 0 < y < 1 is a prescribed threshold, for instance, y = 0.5. The refine- 
ment process is visualized in Fig. 8.4 b) on the cross section of the microstruc- 
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Fig. 8.4. Adaptive refinement a) 3-D unit periodicity cell Y, b) Cross section of Y 
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Table 8.3. Homogenized coefficients for late wood, density u = 91% 


H H H H H H 
level] Eti11 | E2222 | £3333 |E1212| £2303 | £1313 


148.35]152.57] 153.96] 60.22 |62.46 |59.50 
154.34] 162.64] 162.77]69.71|71.31]65.79 
142.66} 148.42]162.79] 60.51 | 65.26 | 63.23 
145.84} 137.61]161.70]53.91|59.04|62.92 
127.99} 134.32]161.43] 49.41 ]56.19]56.49 
98.29 |111.65]160.71) 40.44] 46.14] 48.45 
91.79 | 90.23 |158.29] 35.70 | 43.69 | 46.03 
82.42 | 83.00 |160.57) 30.59 | 41.03 | 43.70 
79.05 | 75.11 |160.22) 26.93] 39.75 | 40.97 
69.66 | 70.30 |159.82] 25.47 37.16 |39.30 


CONDE WNEH 


= 
© 


ture Y for widths of the C- and SiC- layers ay = ag = 0.15. Additional adap- 
tive refinement is generated in the stiffer material (SiC) and on the interface 
between the materials due to the different characteristic constants. 

In Table 8.3 we report some values of the computed 3-dimensional ho- 
mogenized coefficients with respect to the adaptive refinement level for a late 
wood with density u = 91%. More numerical experiments for various values 
of the density and various number of adaptive levels can be found in [HP06a]. 

Table 8.4 presents some convergence results for the proposed precondition- 
ers within PCG method. For various values of the density u of the periodical 
microstructure we report the number of degrees of freedom NDOF, the num- 
ber of iterations ITER, and the CPU-time in seconds for the first 11 adaptive 
refinement levels. One can see from the numerical results a better convergence 
of AMG-preconditioner compared to IC-factorization. We observe an essential 
efficiency of AMG for a larger number of unknowns. 


Table 8.4. Convergence results with IC and AMG preconditioners, density p 


density 2 4 
2 


5 | 6 
NDOF] 78 | 90 | 126 |225|336| 579 
ITER] 9 | 8 | 14 | 23] 40] 66 
CPU |e-16}e-16]}e-16]0.1]0.2] 0.2 
ITER | 11 | 13 | 13 | 15|18 | 23 
CPU ļe-16ļe-16ļe-16|0.210.3| 0.5 





CPU |e-16/e-16] 0.1 ]0.1]0.1] 0.6 | 2.4 | 8.4 | 24.3] 63.7 | 187.1 
AMG |ITER] 12 | 14 | 14 | 14| 18 | 31 | 43 | 73 | 69 | 74 75 
CPU je-16ļe-16ļe-16|0.2|0.4| 1.1 | 3 | 7.5 |15.5| 25.6 | 33.8 
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Summary. We discuss two different approaches related to I’-limits of free energy 
functionals. The first gives an example of how symmetry breaking may occur on the 
atomistic level, the second aims at deriving a general analytic theory for elasticity 
on the lattice scale that does not depend on an explicitly chosen reference system. 


1 Introduction 


The analysis of the mechanical properties of crystals gives rise to internal ener- 
gies that are connected to the geometry of the considered crystal and are often 
linked to properties of the atomistic scale as explained in [CK88] and [JF00]. 
Applications to this theory include among others fatigue phenomena and frac- 
ture mechanics. In the past, various attempts were made to develop a math- 
ematically rigid theory. In particular we want to mention [Bal77], [CLL98], 
[AO05], [OP 99], [FT02] and [Tru96]. Nevertheless, up to now, the relationship 
between macroscopic and atomistic scale is not completely understood. 

Here we contribute to this topic. The text is subdivided into two parts. The 
first gives a simple example where symmetry breaking occurs in the T -limit of 
a one-dimensional monatomic chain when the interatomic distance vanishes. 
Effects similar to the one presented in this first part may also show to be 
relevant for numerical approximation schemes where in certain cases a com- 
petition between elastic energy and surface energy leads to wrong numerical 
solutions, see [Ble06]. 

The second line of investigation is the use of many-body Hamiltonians 
of Kac type to describe elastic deformations, phase changes and eventually 
plastic deformations of a domain 22 C R” without postulating a reference 
configuration on the particle level. It is interesting to compare this ansatz to 
[AO05], where a theory based on algebraic topology is developed. 
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One of the aims is to make a connection with the existing theory on linear 
elastic dislocations, see [TK76], [Mer79], [CC*97], [BC05]. 


2 Phase transitions with symmetry breaking 


2.1 The energy functional 


For given length L > 0, let 2 := (0,L) C R be a domain that contains a 
regular monatomic chain. 

We suppose that the undeformed discrete reference configuration of 2 is 
given by a system of n+1 atoms with equal distance located at points R? € R, 


Rp :=ih” OSic<n. 


Here, the setting h” := L/n defines for given number n € N the interatomic 
distance. The limit n — oo corresponds to h” N 0. The superscript n is 
always used to indicate the dependence on the number of subdivisions. 

By Re, 0 <i <n we denote the position of atom i after the deformation. 
Finally, ie ur, 0 <i<n we denote the two-dimensional displacement vector 
of atom i, given by the relationship 
u? = RP —R", O<iK<n. 


u 


For given deformations {u?}o<i<n we introduce the abbreviations 


n 
n , Ui} 
Pi := Foe 


and for shortness the numbers sı := 1, s2 := 2 and s3 := 4. 
We will study the behaviour of the following energy functional. 
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Here, 0 < a < 1 and [2], = z for x > 0 and [a], = 0 for x < 0. 

The concept behind this ansatz is the following. A minimiser of WP either 
fulfils p?., ~ p? which specifies one lattice order that is in the sequel referred 
to as Phase 1, or př, = 2p? resp. piyi = spr which characterises Phase 2. 

W% represents a surface energy. It counts(and limits) the number of tran- 
sitions between the two phases, as within a phase one asymptotically has 
Pipo = p;. Finally, W} represents an elastic energy. We will show below that 
BP converges in L1(Q2) to the indicator function of Phase 1 and y? to the 
indicator function of Phase 2 as n — oo; ax is the elastic constant to Phase k. 

The functional WF represents the electrostatic energy due to interatomic 
potentials that force the atoms to positions of a certain given lattice order. 

For the analysis we extend the discrete deformation values {u7 }, to piece- 
wise linear functions u” in L?(Q)M.A", where A” denotes the space of piece- 
wise linear functions, see [BDG99]. 


2.2 Identification of the F-limit for W” 


Now we can state the main result. It characterises the T ~-limit of w” as n tends 
to infinity. Let xı := X, X2 := 1 — x. For u € H'?(2), x € BV(Q, {0, 1}) set 


Buw SAE f xe e- a) 
Q 


k=1 ù 


Additionally we introduce W : L?(2) — R by 


inf e gv (7,{0,1}) E(u X) if u € H'*({) is strictly monotone, 
+oo else. 


W(u) := { 


Theorem 2.1 (Characterisation of the T-limit of W”). 
The following statements are valid: 
(i) The boundedness of the energy functional W"(u") implies the boundedness 


of (J (un?) uniformly in n. 
Q n 
(ii) W is the T -limit of W” as n — co with respect to convergence in L?(Q). 
Proof of (i): 
Step 1: Construction of the characteristic function x: 


By C we denote various positive constants that may change from line to line. 
Let (u”) C L?(N) be a sequence with W"(u") < C. We set 


í Pipi 
to 
dj, = 








— si, kå = argmin{ k > di, | Laks 3}. 


The boundedness of W? (u”) implies 
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n-1 3 n 2 
Donl o 
i=0 k=1 Pi 
Therefore there exists a constant C > 0 such that 


sup dy: < CRJE, (2.1) 


For n large enough we can thus define an indicator function x” to Phase 1 by 
0 if z € [ih", (i +1)h"),i<n—-2, ki Al, 


x" (2) = 1 if z € fih”, (i +1)h”), i<n-2, kh =1, 
x"(L — 2h”) if x € [L — h”, L]. 





Next we show that x” € BV (Q; {0,1}), i.e. 


five 20. (2.2) 
Q 


This follows from the boundedness of W3 (u”). Since for large n 


Pa = sk +0(1) for some k € {1,2,3}, 


we see that if x” (x) jumps in z = (i + 1)h” between 0 and 1, then 
Pito ) 21 
pote) > 2 +o(1 
( Ta A 
which shows W% (u”) > (3+0(1)) Jo |Vx"| and proves (2.2). Here we adapted 
the Landau notation and denote by o(1) terms that tend to 0 as n > co. 
With (2.2), well-known compactness results imply the existence of a subse- 


quence (again denoted by) x” and a x € BV(Q, {0,1}) such that x” — x in 
LHA). 


Step 2: Convergence of 8”, y” in LH(Q): 
We extend the discrete quantities {87 4, {77 } to piecewise constant func- 
tions in L1(Q) by the definition 


a(x) = {OF Hee lih”, (i+ Yh") and i< n2, 
T=] 0 Aes Ae, 


In the same manner, the extension y” of {7;"}; is defined. 
Straightforward computations show 


Bray, v4 (l-x) in LN) forn > 00, (2.3) 
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where the function x € BV(Q, {0,1}) is the limit of x” found in Step 1. 
Step 3: Boundedness of f |(u”)’|? uniformly in n: 
We choose constants i € Rt, b € R such that 
min{(x — a4)", (x — a2)*} > ax? — b. 


Due to the boundedness of W3(u”) we thus find that there exist constants 
Ch, Cz > 0 such that 


n—2 n n 
Cr > (o Y (E (gp +a). 
1=0 


Since pf, = spp? + o(1) for a k € {1,2,3} and large n we find that 


Batna Gastro) (E) 
Piti T Pi ie et ENV. 
( 2 ZES NS 


The term (8? + y7) can for large n be estimated from below by a constant. 
So we find the existence of a constant C > 0 with 


n—2 
ny\2 pi : 
Gate) Se) (2.4) 
Due to the estimate (p"_,)? < (2+0(1))p"_» the sum in (2.4) can be extended 
to i = n — 1 and the estimate still holds. 
The sum )>,(p/)? is directly related to fo |(u”)’|? where u” is the piecewise 
affine linear extension of {u?’};. With (2.4) extended to i = n — 1 this yields 


n-1 


sup f \(u" = sup h” (ph)? < C. (2.5) 
Q 


i=0 


Proof of (ii): 
Step 4: Lower semicontinuity of W”: 

We have to show: for every sequence (u”)nen with u” — u in L?(Q) there 
exists a subsequence (u”*),en with 


W(u) < lim inf W"*(u"*). 


For unbounded W” (u”) there is nothing to show. So let W"(u”) < C for all 
n. From (2.5) follows u”, u € H!?(Q) for all n € N. Because of the reflexivity 
of the Hilbert space H!:?(Q) we know that there exists a subsequence (again 
denoted by) u” such that u” — u, in H!?(Q) for n — oo. From Step 2 we 
know that x” — x, 8" > x, Y” — 1 — x in L1(2) for n — oo. Because of 


oo > $ + 0(1), for n > no we find that u” is monotone for large n. 
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Now we estimate W” (u”) from below. We claim 


liminf W"(u") > E(u, x) > W(u). (2.6) 
With the help of Theorem 3.4, p.74 in [Dac89], the proof of (2.6) is straight- 
forward, estimating every component of W” (u”) separately. 


Step 5: Existence of a ”recovery sequence”: 
We have to find a sequence (u”) C L?(Q2) converging to u in L?(Q2) with 


W(u) > lim sup W” (u”). 
n—>oo 

If W (u) = +00, there is nothing to show. Due to the monotonicity prop- 
erties of u demonstrated above we know that the functional y + E(u, x) is 
bounded from below in the BV-norm. Using the compactness properties of 
BV (2) and the coercivity of E, it is clear that E(u,-) attains its minimum, 
ie. W (u) = E(u, X) for some x € BV (9, {0, 1}). 

Next we show that for piecewise affine, strictly monotone u there exists 
a sequence u” with u” > u and W” (u”) > E(u, x). We start with special 
cases, then generalise. 
Case 1: u’ = a, > 0, x = const in 22: 
(a) x =1 in 2: We simply set u” := u for all n. 
(b) x= 0 in 2: For x > 0 choose u” such that p? is alternating between 
and $a;. Furthermore u” satisfies u” (x = 0) = u(x = 0). 


2 


31 


Case 2: uw =a, >0,x=1for0<a2< 4, x =0 fora > &. 

The treatment of this case is more difficult. It is not possible to directly 
combine the two ansatz functions for u” of Case 1 because for one index i this 
would mean p? = ah” and either pẹ, = Zar h” or pi = faih”, leading to 
limp—soo WP (u”) = oo. 

Therefore we have to introduce a transition layer of width (h”)* between 
the two phases, where s > 0 is a small constant to be chosen later. We define 


ai for0O<a< Z, 
p” (x) = 4 ar + Bh”) (x — &) for & <a < $4 (h")’, 
$a for ž + (h”)5 <æ < L. 


We set u” such that u” (x = 0) = u(x = 0) and 


n g” (ih”) for ih” < P 
Be sy"(ih”), y"(ih”) alternating for ih” > $. 
With this construction, the proof of convergence to 0 of the p?’-terms in 
W? is straightforward. Hence W? (u”) — 0 as n — co. 
For the estimation of the functional W3’(u”) we have 


25 pa e —_ £ 1 g” (ih”) - p” (G + 2)h”) |? 


ln - Piya |? _ ree 
2 yp(ih”) Qa 22. pr(ih”) 


p? 
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For I :— 27h") =e" (G+2)h”) 


TURF simple computations yield 


0 if (ih” > 4) or ((i + 1h” < 4) 

or (Ž < ih” < $ + j 
—s,(h”)1~8 if (ih” > & and (i +2)h” < £ + (h")’) 

or (ih” < Žž and £ < (i +1)h” < £4 (h”)5). 


ad 
= 
3 
~ 
v 
5 
a 
PN 
> 
+ 
wy 
= 
3 
V 
bolt 
+ 
Pst 
= 
3 
E 


l= 





and for 0 < s < 1 the convergence of W3'(u”) to 4 can be assured. 

For the estimation of W3 (u”), it is clear that outside the strip of width 
(h”)* the summands in W3 (u”) equal (h")$[x(a1 — a1)? + (1 — x) (ai — a2)? ]. 
Inside the strip, we have approximately (h”)*~! summands, where each sum- 
mand is of the form (h”)C. Thus, the part inside the strip tends to 0 for 
n — oo as long as s > 0. 





Case 3: General y € BV(Q; {0,1}) and piecewise affine, monotone and con- 
tinuous u: The construction of u” can be done by iteratively applying the 
construction given in Case 2. 


Case 4: General monotone u € H'?(2): 

Let u be a generic monotone function in H+?(Q) and let {u"} be a se- 
quence in A” such that u” — u in H1?(Q). For every n we can apply 
Case 3 to find a sequence {w7} such that w? > u” in L?(Q) as n —> oo 
and lim sup, W! (w?) < W(u”). Then we have 


lim sup lim sup W' (w?) < lim sup W (u”) = W (u), (2.7) 

noo loco n—0co 
where (2.7) holds because of the strong convergence of u” to u in H'?(Q). By 
diagonalisation, we find a sequence ù” := Wyp) such that wu” — u in L?() 
and lim sup,,_..5 W"(a") < W (u,v). 














3 An atomistic model for phase transitions of elastically 
stressed solids 


In this section we present work planned for the last year of support within 


the priority program. We start with the following Hamiltonian that has been 
proposed by S. Luckhaus, 


Hieren = / Cee 
2 


with 


bla {ahe = jt | S7¥( SS) WalAei +7) + F(4)]- B 


ier 
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Here, J is a finite index set, {x;};e, denotes the positions of the atoms, Wa 
is a periodic, non-negative potential whose zeros are on the unstrained lattice 
Aa corresponding to phase a; F plays the role of an elastic energy, and ~ is 
a cutoff function, à € Rt a scaling parameter. For a spatial point x € Q, 
the infimum in (3.1) is taken with respect to deformations A = As E€ GL(n), 
translations T = Ty € R” and phase a. 

In a suitable way, Wa can be interpreted as a mean field Hamiltonian that 
is acting on the ’one-particle density’. 

This Hamiltonian gives a reasonable description for states which have a 
lower and upper density close to that of a sheared lattice. One way to in- 
corporate this restriction on the level of the Hamiltonian itself could be to 
define 


ba, {xi }ier) z jae, (el 
O ael 


+ (5#) [5 - Wal Ay +7) -Sew-a)] dy) 
Q 


icl 


P a Uy 





)Wa(Az; +7) + F(A) 


> 





and to set 
Meihe) = f Ue, tehen +5 oles - 23): 
Q ifj 
In the last line, ¢ may have compact support or can be a hard core potential, 
the positive part [z], of z is [z]_ := z for z > 0 and [z], := 0 for z < 0. 
If À is large it makes sense to speak of the open connected sets where 


W(x, {zi}ier) < 0(A”) 


as the domains of one elastic phase. 

For x in these phase domains we conjecture that the minimal Az, Tr, az 
satisfy that (Ax, Tx) is unique modulo the affine isotropy group of the lattice, 
and a, is constant in each domain. 

A precise (and hopefully not too restrictive) estimate when this is the case 
is currently work in progress. 

If one assumes the uniqueness of Az, Ty and the constancy of a, in a 
simply connected subdomain 0, then one may construct an elastic deforma- 
tion ® such that the projection of ~'(x) is Ty and such that V(®~')a has a 
projection close to Az. 

Without assuming simple connectedness of 2 there may be an obstruction 
to the existence of &. Topologically speaking this obstruction is a homomor- 
phism 

B: m1 (92) > Ag 


from the group of affine mappings into the lattice corresponding to phase a. 
If the linear component is the identity, B coincides with the Burgers vector. 
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Since the functional — in terms of A — is automatically invariant under the 
lattice group, it does not make sense to investigate energy minimisers. It is 
well-known that energy minima do not sustain shear, [FT89]. 

So, the question is to characterise metastable states. This is completely 
open at this time. 
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Summary. The passage from microscopic systems to macroscopic ones is studied 
by starting from spatially discrete lattice systems and deriving several continuum 
limits. The lattice system is an infinite-dimensional Hamiltonian system displaying 
a variety of different dynamical behavior. Depending on the initial conditions one 
sees quite different behavior like macroscopic elastic deformations associated with 
acoustic waves or like propagation of optical pulses. We show how on a formal level 
different macroscopic systems can be derived such as the Korteweg-de Vries equation, 
the nonlinear Schrödinger equation, Whitham’s modulation equation, the three-wave 
interaction model, or the energy transport equation using the Wigner measure. We 
also address the question how the microscopic Hamiltonian and the Lagrangian 
structures transfer to similar structures on the macroscopic level. Finally we discuss 
rigorous analytical convergence results of the microscopic system to the macroscopic 
one by either weak-convergence methods or by quantitative error bounds. 


1 Introduction 


A major topic in the area of multiscale problems is the derivation of macro- 
scopic, continuum models from microscopic, discrete ones. The prototype of 
a discrete many-particle system is a periodic lattice for modeling a crystal. 
Starting from the seminal work of Fermi, Pasta, and Ulam ({[FPU55]), a lot of 
interest and work has been attracted to the study of the statical and dynam- 
ical behavior of ordered discrete systems. In the dynamical situation one is 
interested in macroscopic limits that are obtained by choosing well-prepared 
initial conditions: We choose the initial data in a specified class of functions 
and want to obtain an evolution equation within this function class, which we 
call the macroscopic limit problem. This approach is motivated by the the- 
ory of modulation equations, which evolved in the late 1960’s for problems in 
fluid mechanics (see e.g. [Mie02] for a survey on this subject). If the linearized 
model has a space-time periodic solution, one asks how initial modulations 
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of this pattern evolve in time. The modulations occur on much larger spatial 
and temporal scales; thus the modulation equation is a macroscopic equation. 

In mathematically rigorous terms this can be described by studying the 
following coarse graining diagram: 


Se 


microscopic ees macroscopic 
initial data t = 0 z? en, A’ 
[e> r>o| 
ze(r/e”) -522 A(r) 


discrete, atomistic |coarse graining| continuum 


Here Ze : [0,7./€°] — Ze denotes the solution of the microscopic model de- 
pending on the microscopic time t and A : [0, 74] — Zo is the solution of the 
macroscopic model. In the best case the diagram commutes, i.e., if the coarse 
graining SeZe(T/£7) — A(T) holds at time T = 0, then it also holds for all 
T € [0, T4]. Examples of such results will be Theorems 5.1, 5.2, 7.1 and 7.2. 

Before establishing these results, we survey methods to derive macroscopic 
models on the formal level by using a suitable multiscale ansatz and expanding 
the coefficients of equal powers of the small parameter and of the harmonics 
of the microscopic fluctuation to 0. The emphasis is to survey the theory and 
to explain the main techniques and results on simple models like the FPU 
chain or the Klein-Gordon chain, see Sect. 2.3. 

Naturally, our survey can only cover a small part of the rich subject of 
dynamics in discrete systems. We will totally omit any of the works on static 
solutions for lattices, see e.g. [FJ00, BG02b, BG02a, Sch05a, Ble05, MBLO6). 
Moreover, there is a huge body of work concerning the understanding of 
special solution classes like traveling or standing pulses with or without pe- 
riodic modulations, see [FW94, MA94, Kon96, FP99, Ioo00, IK00, FMO02, 
FP02, FM03, Jam03, FP04a, FP04b, IJ05, DHM06]. The response of oscillator 
chains to a simple initial disturbance or to Riemann initial data is studied in 
[DKV95, DK*96, DK00, BCS01, DHRO6], where in particular completely in- 
tegrable systems like the Toda lattice are of interest. Finally in the framework 
of non-equilibrium statistical mechanics (cf. for a survey e.g. [Spo91, Bol96]) 
one is interested in highly disordered systems, where only statistical averages 
satisfy nice macroscopic equations. 


2 The discrete models 


In the first subsection we write down the class of systems that can be treated 
with the methods surveyed below. This includes general polyatomic lattices in 
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any space dimension. The interactions can be general and can occur between 
several atoms, not just pair potentials, and can have arbitrary range. In the 
second subsection we treat the linearizations, which simplify a lot and can 
be treated in particular by Fourier transform methods. There, the central 
structure are the different dispersion relations, which will be used heavily in 
the subsequent analysis. Finally we present two simple model problems that 
represent most of the interesting features. These models will be addressed in 
most of the following results to illustrate the general results. 


2.1 General lattices systems 


We model a perfectly period crystal based on a d-dimensional Bravais lattice 
I embedded into Rt. This lattice is homeomorphic to the additive group Z4 
but might have a different metric structure. Each lattice point y € I” denotes 
a unit cell in the actual crystal and, hence, the vectors x, € R” and t4 
are collections of all the relevant positions and velocities, respectively, of the 
atoms inside this unit cell. By (x, x) € €2(I’)™ x €2(L)™ we denote the state of 
the system, where x = (£y)yer and x = (&,)yer. By M € R™*™ we denote 
the mass matrix for each cell, which is assumed to be symmetric and positive 
definite. The total kinetic energy in the crystal is 


K(x) = (Mx, x) © Eyer (Mty, dy). 


The potential energy V(x) is obtained by adding up all contributions acting 
on one cell via a single potential Veen : €2(I)™ — R given the forces of the 
state x on the cell at y = 0: 


V(x) = Veer Veen (Tax). 


Here T, is the translation operator with Tax = (ta++)yer. In the case of 

finite-range interaction the potential Vse only depends on finitely many com- 

ponents, e.g., Veen(x) = Vo(zo) + X o<jj<r Vy(®y— 20) for pair interactions. 
The Newtonian equations for this lattice model are given as 


Miły = —Dz, V(x) = -Ð ver V Veeu(Tax) fory ET. (2.1) 


Tazo 


Of course this system is invariant under the translations Ty, a € I, and has 
the total energy €(x,x) = K(x) + V(x) as first integral. Moreover, it is a 
canonical Hamiltonian system with momenta p = Mx, Hamiltonian function 
H, and symplectic form wean: 


H(x,p) = 3(M~'p,p)) + V(x) and 
Wean((V1, qı), (v2, q2)) = (v1, q2)) => (vo, qı). 


Clearly, the Newtonian equations (2.1) are equivalent to the Hamiltonian 
equations X = pH (x, p), p = —OxH(x, p). Moreover, they can be obtained 
as the Euler-Lagrange equation for the Lagrangian 


L(x, x) = K(x) — V(x). (2.3) 


(2.2) 
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2.2 Linear systems and dispersion relations 


Linearization leads us to linearized systems, where the potential V is a 
quadratic form. The linear equation takes the form 


Mi, = Xeer Agty+e = eek Ayata for vel, (2.4) 


where the interaction matrices satisfy the symmetry condition Ag = A! g and 


a decay condition like ||Ag|| < Ce~°!%!. The quadratic potential energy then 


reads V(x) = 3 oe, ver (Ayaka: Ly). 
An essential feature of such harmonic lattices is the presence of many 
traveling wave solutions in the form of plain waves: 


r(t) =e?" where 6 € RÉ and (A(0) — w?M)& = 0. (2.5) 


The wave vectors 0 are taken from the torus Tr, which is obtained by factoring 
R?¢ = Lin(R?) with respect to the dual lattice. The symbol matrix A(0) reads 


A(O) = Joger e An ecrx™ for € Tp. 





Hence, A(@) is Hermitian, and we always impose the basic assumption of 
stability in the form A(@) > 0 for all 6 € Tp. 

Plane-wave solutions as in (2.5) exist if w and 0 satisfy the dispersion 
relation 

0 = Disp(w, 0) $ det (um = A(0)). 
Under our stability condition, there are always m non-negative eigenvalue 
curves 
w = R0), k=1,...,m, 


which we order such that 0 < R1 < Rə < --- < Rm. The index k is called the 
band index. Two velocities will be important below, the phase velocity Cph 
and the group velocity Cgr: 





coh = a O SO and anO) = VO 


The dynamics of the linear system is completely determined by M and 
the symbol matrix A : Trp > CLV”. This is easily seen by transforming 


(2.5) into wave vector space. For this define X(0) = Fx : Tr — C™ via 
Fx de ez, then X(t) = Fx(t) : Tr > C™ satisfies the equation 


Mé?X(t, 0) = —A(0)X(t, 0) (2.6) 


if and only if x satisfies (2.5). However, the latter equation is an ODE for each 
fixed 0 € Tr. 

For studying the qualitative behavior of the solutions in the subsequent 
sections, this is not sufficient, and we need to understand the back-transform 
for large times t. Then, the smoothness properties of the dispersion relations 
will be important, see Sect. 5.3 and 6.2. 
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2.3 The chain models of FPU and KG 


To illustrate our abstract theory we will frequently refer to the simple scalar 
and one-dimensional case, viz., l = Z C R and x; € R. The models have the 
general form 


äi = -Vg (a;) + Daa (Wei en—2;) — Vi@y—aye)), FEZ (2.7) 


Here Vo is called the on-site potential that couples the atoms to a background 
field. The interaction is assumed to be pairwise and involves K neighboring 
atoms. 

The Fermi-Pasta-Ulam chain (FPU) is obtained by omitting the on-site 
potential and choosing K = 1: 


ëj = Vi(ej41—253) —Vi(@j—-a5-1), JEZ. (2.8) 


The importance of this model is its Galilean invariance, i.e., for all £,c € R 
the transformation (x, x) > (a;+€ + ct, tj+c)jez leaves (2.8) invariant. 
Another simple class is obtained by assuming again K = 1 with linear 
nearest-neighbor interaction and a nonlinear background potential. In analogy 
to the Klein-Gordon equation this model is called Klein-Gordon chain (KG): 


Ëj = @j41 — 22; +2;-1-Vo(az;), JEZ. (2.9) 
In these two models the dispersion relation has the structure 
0 = Disp(w, 0) = w? — a — 2b(1— cos 0) with a = W3’ (0) and b = V! (0), 
where a,b > 0 is equivalent to our stability condition. The solution reads 
w = (0) = (a+ 2b(1— cos De 


which is smooth for a > 0. For a = 0 we find Q(6) = Vb2|sin(@/2)|, which 
is not differentiable at 6 = 0, but the two limits +Vb of Q’ at 0 = 0 are the 
macroscopic wave speeds. 





3 Formal derivation of continuum models 


3.1 General multiscale approach 


We discuss here the derivation of macroscopic models that appear for solutions 
having a relatively small amplitude, but we refer to [DHRO6] and Sect. 3.7 for 
results on large amplitude solutions. 

The basic ansatz relies on modulations of basic plane waves e'(#*+?')@ on 
large spatial scales and suitably chosen slow time scales. We choose € > 0 
to be the small parameter that relates the microscopic and the macroscopic 
temporal and spatial scales, i.e., we set 
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tT=et and y=eyER? foryercR’. 


Of course, there are cases where different scalings in different spatial directions 
are useful, but for simplicity we restrict ourselves to this case. 

We now choose a finite set of wave vectors 6,,...,9y € Tr and associated 
band indices kı,..., ky € {1,...,m} and consider the associated plane waves 


z(t) = En (t, y)Ðn, where E,,(t,y) $£ eiat+-) 
with wn = Rk, (An) and &, = By, (An). 
This may include the case 0 = 0 and w = 0, which relates to the macroscopic 


limit of solutions without microstructure. 
The two-scale method now starts from the ansatz 


(x(t), &,(t)) = Re(A) (t), where A = (A1,..., An) and 
R.(A),(t) = D2 E7” An(e*t, €7)En(t, y)®n 





3.1 
T sae ern tor, p(ESt, ey) En Ek (3.1) 
T A pa ETEA D, p (Et, ey)En ERE; + h.o.t. 
Here the powers s,01,...,7N €E R have to be chosen appropriately. We refer to 


the variety of different models that can be obtained in this way. To obtain real- 
valued solutions one chooses A, = Ay-n and similarly for the higher order 
terms. In cases with 6,, Æ 0 the functions A, are the modulating amplitudes 
of the basic periodic plane wave. 

The aim is to derive suitable equations for A,,..., Ayn, which make this 
ansatz (3.1) consistent with the discrete model (2.1). The obtained equa- 
tions are partial differential equations combined with some algebraic relations. 
These equations are called the macroscopic equations, because they are posed 
in terms of the macroscopic variables T = e*t and y = ey. Inserting the ansatz 
(3.1) into the nonlinear system (2.1) we have to expand both sides in terms 
of the products e1 Ef with ¢ = Sea OnQn- Here we have to expand 
difference quotients 7+. — £y in terms of spatial derivative of A,. Moreover, 
the resonances between the plane waves are important to allow for nontrivial 
nonlinear interaction. They are characterized by vectors q € N such that 
ITN_,E% = 1, see Sect. 7.2 for a general theory. 

We arrive at a hierarchy of equations that can be parametrized by the 
multi-index q = (m,.-.,¢v) € NN. These equations decompose into two 
groups. If the term Eg = J7/_,E% is nonresonant, i.e., different from all 
the terms ei('+?-7) that satisfy the dispersion relation, then the equation for 
Wa(T, y) is uniquely solvable. The resonant groups associate with the terms 
Eq = IN E} that equal one of the terms ei(% (+) which without loss 
of generality is already in our list, let us say Em. Naturally, the coefficient 
W, cannot be determined uniquely, because the plane wave Em®m solves the 
linear problem. Thus, by Fredholm’s alternative we obtain a solvability con- 
dition for the terms on the left-hand side that contains only lower order terms 
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that are already determined. This gives either a PDE or an algebraic equation 
on the previously chosen functions. Moreover, the general solution contains a 
new scalar function Bg, namely Wg = Bam + we. 

We refer to [Mie02, GM04, GM06] for a more detailed description of this 
procedure. In fact, without doing any explicit calculation on the specific dis- 
crete lattice system (2.1) it is possible to describe the form of the macroscopic 
equations as follows: 


If we #0: O-Ak = Vigemy(s) ell AN, a 
If wp =0: Ar = Faem 09) CaN AM 2) 
where M;(s) pe {q | ok+s= D Ondn, 0= ae Wn ns su nn = 0 on Tr}. 
For more details see [Gia06, GMSO06] and Sect. 7.2. 

The following Sect. 3.2 to 3.7 treat a list of examples, which highlight the 
generality of the approach. 


3.2 The quasilinear wave equation 


A simple but important macroscopic model for FPU chains results by the 
following multiscale ansatz with hyperbolic scaling: 


rit) =e" X(et, ej), T=et, ye). (3.3) 


Note that here x; denotes the spatial position of atom j rather than its dis- 
placement. We insert the ansatz (3.3) into (2.8) and eliminate the relative 
displacements by the Taylor expansion xj;41 — x; ~ +e¢0,X (et, ej). Using 
ð- = €0; we can identify the macroscopic modulation equations as the non- 
linear wave equation 








Or7-X — ð Vi (3X) = 0. (3.4) 
Via r = ð} X and v = 0,X it transforms into the quasilinear first-order system 
ðr —Oyv=0, v -— ôy Vi(r)=0. (3.5) 


These equations describe the macroscopic evolution of non-oscillatory solu- 
tions of FPU. However, due to the nonlinearity V/ smooth solutions of (3.5) 
can form shocks in finite times, and in this case the quasilinear wave equation 
is not longer an appropriate macroscopic model for FPU. This problem is 
addressed in [DHR06]. 


3.3 The Korteweg-de Vries equation 


Another example for macroscopic modulation equations, see [SW00, FP99], 
relies on the KdV-ansatz 
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a,(t) = eU(e%t, e(j+ct)) (3.6) 
with scaling T = e%t, y = e(j+ct). We insert the ansatz into (2.8) and use 
Taylor expansion up to order O(e®). Comparing the leading order terms we 
find that c is given by c? = Vj’(0). Since the next order terms all cancel, 


the modulation equation is determined by the terms corresponding to £f, and 
finally we obtain 


2COnyU — yc? OyU yy — Vi" (0) ðyyyyU = 0, (3.7) 


which is a KdV equation for 0,U. 


3.4 The nonlinear Schrödinger equation 


We consider the scalar, d-dimensional lattice (2.1) (i.e., d € N and m = 1) 





by = Vocjaj<rlV3(@+e-2y)—-V3(@y—-@y-8)] -—Vo(ty), VEL, (3.8) 


and are interested in the macroscopic deformations of a modulated plane wave 
solution of the linearized system 


a(t) = €A(r, y)E(t,y) t+¢c.+ O(c?) with E(t,y) = ette) (3.9) 


(c.c.: conjugate complex) for a fixed wave vector 0 € Tr with frequency w 
satisfying the dispersion relation w? = 2?(0) > 0. 

Since the system is dispersive and nonlinear and the amplitude A is weakly 
scaled by 0 < £ < 1, we need a slow macroscopic time scale T = et comparing 
to the macroscopic space scale y = €(y—Cgrt), in order to see the evolution of 
A as time passes. This is the so called dispersive scaling. The choice of y also 
reflects that we are moving with the pulse at its microscopical group velocity 
Cer = Vo R(0). By this scaling it turns out that the evolution of A is given by 
the nonlinear Schrödinger equation 


1 
id, A = Divy(3D32(0)V,A) + p| A|? A. (nlS) 


For the justification of this equation we refer to Sect. 7.1. 


3.5 Three-wave interaction 


For the lattice (3.8) we are now interested in a macroscopic description for the 
evolution of the amplitudes A,, n = 1,2,3, of three nonlinearly interacting 
modulated plane waves with different wave numbers @, and frequencies wn, 
where w2 = 27(6,,). Thus, ansatz (3.1) takes the special form 


z(t) = € 3? _, An(t, y)En(t, y) + c-c. + O(c?) with En (t, y) = ertt) 
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Fig. 3.1. Three-wave interaction. Left: typical initial condition. Right: energy- 
distribution in space-time. 


but now using the hyperbolic scaling T = et, y = ey again. It turns out that, 
if the wave vectors @,, and frequencies wpn are in resonance, viz., 


0i + A + 03 =0 mod Tr and Wy + w2 + w3 = 0, (3.10) 


the amplitudes An, n = 1,2,3, satisfy the so called three-wave interaction 
equations 
w10, Ay = w1 Vo (01) Vy A1 + cA2As, 
w20, Ag = w2V o N(02):V y A2 + cA As, (3.11) 
w30,A3 = w3 Vo NR (03):Vy A3 + cA; Ap, 





with c = 2 )iocjaj<r Ve (0) yy _, sin(On- 8) +i Vg” (0). Each equation consists 
of a transport part via the group velocity and a nonlinear coupling to the 
two other modes. Figure 3.1 illustrates the behavior. Without the resonance 
condition (3.10) being fulfilled, nonlinear terms would not arise and the pulses 
would just pass through each other. For the justification of this equation we 
refer to Sect. 7.2. 


3.6 Coupled systems 


While the two examples above apply to a system with or without background 
potential Vo, we are now looking at systems with Galilean invariance, where 
the canonical example is FPU from (2.8). The aim is to understand the 
coupling between macroscopic deformations and microscopically oscillating 
pulses. Since in general the macroscopic wave speeds and the microscopic 
group velocity are different, we use the hyperbolic time scale T = et. Ansatz 
(3.1) reduces to 


ay(t) = e*X(r,y) + EP A(r, y)E + eF A(r, y)E + hot. 


with E = e'*+) and w = (6). Here a and 8 might be different and depend 
on the nonlinearities as well as the scaling of the initial data. We treat the 
case of the FPU chain with Vi (r) = $r? + èr? + Sr‘ with a > 0 and b,c € R. 
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As a first example we consider the case a = 0, 3 = 1 and find the system 
OX = ch? X, i8,A = icgrOyA — po(OyX) A 
with cm := 2'(0) = Va, Cgr := 2'(0) and po := 2b2(0)/a. Since the contribu- 


tions X and A scale differently, the coupling of X and A takes place only in 
one equation. We have the two conserved quantities 














H(A) = f,w2|A/?dy and E(X, X+) =f, 4X? + BX2dy. 


The second example has a = 0 and 8 = 1/2, which leads to the system 
X- = (c2 Xy + pilAl?) 2iwA, = iwegrAy — (p1Xy + 2p2|A|?) A, 
where p1 := 2b(1— cos 0) and p2 := 3c(1—cos 6)? . This system is a Lagrangian 


and Hamiltonian system in the sense to be discussed in Sect. 4. The La- 
grangian reads 


L(X, A, X,, A7) 
= fr o Im (A(24,—cgrAy)) 4 





c2 
X? — $X? — |A|? (p1Xy+p2| Al?) dy. 


1 
2 
There are two first integrals 


H(A) = fg w?| Al? dy, 
(X, A, X+) = fp wegr Im(AAy)+$X2+@ X24|Al? (91 Xy+p2|A|?) dy. 














The symplectic structure of the associated Hamiltonian system for (X, A, X+) 
is non-canonical and can easily be deduced as in Sect. 4.2. 


3.7 Whitham’s modulation equation 


In [Whi74] Whitham studies certain nonlinear PDEs and relying on the hyper- 
bolic scaling he develops a theory that is capable to describe the macroscopic 
evolution of large microscopic oscillations. Here we apply Whitham’s approach 
to three different chain models. We start with KG, cf. (2.7), and make the 
following multiscale ansatz 


a;(t) = X(et, ej, e71 O(et, ej)), (3.12) 


where X is assumed to be 27-periodic with respect to the phase variable 
ġ = 710. In this ansatz both the wave number @ and the frequency w 
depend on the macroscopic coordinates (T, y) and are defined by the modulated 
phase O via 0 = 0,0 and w = 0,0. It can be shown that to leading order 
the function X must satisfy the following nonlinear advance-delay-differential 
equation 


w? OZE = V_oV+0X — V(X) (3.13) 
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with (VieX)(o) = +2(¢ + 0) F X(¢). As usual we refer to solutions of this 
equation as traveling waves. The existence problem for solutions of (3.13) with 
small amplitudes is investigated in [IK00]. For convex potentials Vo we can 
provide existence of solutions by adapting an idea from [FV99], compare with 
the similar problem for FPU in [DHRO06]. According to (3.13), the action L 
of a traveling wave is given by 

















LO, w) = ge fo” $ (OsX)” — (V402)? — Vo(X) de. (3.14) 
To identify the macroscopic modulation equations it is convenient to use the 
Lagrangian formalism, see [Whi74] and Section 4.1, because direct expansions 
in powers of € turn out to be quite complicate. With some simple averaging the 
total action of the chain can be expressed by a functional L, which depends on 
(O, 0,©) only, and we can derive the modulation equations by the principle 
of least action. It comes out that the modulation equations are equivalent to 
the following nonlinear system of conservations laws 


0,6—-Oyw=0, O,S+dyg=0, (3.15) 


where S = 0,,L and g = OL. In particular, the system (3.15) is closed by the 
equation of state (3.14) and the Gibbs equation dL = Sdw+gd0. 

The modulation theory for FPU, see [DHM06, DHR06] and the references 
therein, is more complicate than in the KG case due to the nonlinearity of 


V,, and the Galilean invariance of (2.8). In particular, we must combine (3.3) 
and (3.12) as follows 


a,(t) =e * X (et, ej) + X(et, ej, €71 O(et, ej)), (3.16) 


where as before the profile function X is assumed to be 27-periodic with 
respect to @ = £710. This ansatz gives rise to four important macroscopic 
fields, namely the wave number 0 = 0,0, the frequency w = 0,0, the specific 
length r = 0,X, and the macroscopic velocity v = 0;X. To leading order, the 
profile function X must satisfy the traveling wave equation 


w? 3 X = V_oVi(V+0 £). (3.17) 


For convex potentials V; the existence of solutions can be proved by an convex 
optimization problem, see [DHR06], and rigorous results without convexity 
assumptions can be found in [FW94, PP00, Ioo00]. The derivation of the 
modulation equations for (3.16) again relies on Lagrangian reduction, see for 
instance [Her04, DHM06], and leads to the following nonlinear system 


ðr —Oyv=0, O-v+0yp=0, 0,0-Ojyw=0, O-5+0,g=0. (3.18) 


These equations can be interpreted as the macroscopic conservation laws of 
mass, momentum, wave number and entropy. As for KG, the constitutive re- 
lations for (3.18) result from a careful investigation of the thermodynamic 
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properties of traveling waves. More precisely, it can be shown, at least for- 
mally, that (3.17) provides an equation of state U = U(r, 6, S) as well as 
the universal Gibbs equation dE = wdS — pdr — gd0 + vdv, where U and 
E = sv? + U denote the internal and total energy, respectively. 


The third example is the discrete nonlinear Schrödinger equation 
id; + aj41 — 2a; +a;-1 + olaj/’a;=0, FEZ, (3.19) 


with complex valued a; and real parameter 9. This equation has exact solu- 
tions (traveling waves) of the form a; = Be'(+“4 with real amplitude B 
if w obeys the nonlinear dispersion relation w + cos@ — 2+ ọ B? = 0. The 
modulation theory for several variants of (3.19) was studied in [HLM94] and 
bases on the multiscale ansatz a;(t) = B(et, ej) e C4 ©9)/*, where as before 
we set 0 = 0,0 and w = 0,@. One obtains the macroscopic balance laws 


0,(A?) — 0,(2 A” sin#) =0, 0,0+0,(0A” + cos6) =0, (3.20) 


where the second evolution equation is equivalent to 0-6 — Oyw = 0. We men- 
tion that (3.20) can also be derived by means of Hamiltonian or Lagrangian 
reduction discussed in the next section, see [HLM94] for the details. 


4 Hamiltonian and Lagrangian structures 


The derivation of macroscopic equations for discrete models (or continuous 
models with microstructure) can be seen as a kind of reduction of the infinite 
dimensional system to a simpler subclass. If we choose well ordered initial con- 
ditions, we hope that the solution will stay in this order and evolve according 
to a slow evolution with macroscopic effects only. We may interprete this as a 
kind of (approximate) invariant manifold, and the macroscopic equation de- 
scribes the evolution on this manifold, the functions A1, ..., Ay defining kind 
of coordinates. For such a reduction procedure it is a natural question how 
the original Hamiltonian and Lagrangian structures, as described in Sect. 2.1, 
“reduce” to the macroscopic equation. Here we just survey the main ideas and 
some examples and refer to [GHM06] for the full details. 

Before addressing this question we first address the exact reduction of a 
Hamiltonian and Lagrangian systems to exactly invariant manifolds (cf. e.g. 
[Mie91]). First consider the Lagrangian setting for £ defined on TQ. Assume 
that we have an invariant manifold M C TQ given in the form 


M = {(q,4) = S(p, p) € TQ | (p, p) E TP}. 


Then, we may define the reduced Lagrangian L"°4 = LoS : TP — R. An easy 
calculation proves that any solution p of the reduced Lagrangian system 


0 = —2(d,L'°4(p, p)) + OpL*°4(p, p) 
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leads to a solution (q,q) = S(p,p) of the original Lagrangian system. Vice 
versa, any solution of the latter system that also lies in M solves the reduced 
Lagrangian system. 

In the Hamiltonian case the tangent bundle structure of Z = TQ is gen- 
eralized to a general symplectic structure w on the state space Z. Together 
with the Hamiltonian H the Hamiltonian system reads 


Q2(z)z=DH(z) or %=J(z)DH(z), 


where w,(v1,v2) = (Q(z)vi, v2) and J(z) = Q(z): TZ — TZ. For a 
symplectic, flow-invariant submanifold M = {z = R(y) € Z|y € Y} we 
define the reduced symplectic structure 92°% and the reduced Hamiltonian 
Hre4 via 


Q™ (y) = DR(y)*Q(Ry))DRy) and H™*"(y) = H(R(y)). 


Using the flow-invariance of M it is easy to see that any solution of the reduced 
Hamiltonian system 2°°"(y)y = DH*4(y) solves the original system and vice 
versa if starting on M. 

Our applications will of course use the ansatz Rs from (3.1) for the reduc- 
tion, which can be seen as an approximation of an invariant manifold. 


4.1 Lagrangian reduction 


The multiscale ansatz (3.1) discussed above was chosen such that it is formally 
consistent and in many cases it is possible to justify the ansatz by a rigorous 
error analysis, as surveyed in Sections 6.1 and 7. Hence, we consider the mul- 
tiscale ansatz as a parametrization of an (approximate) invariant manifold. 
Inserting the ansatz (3.1) into the Lagrangian £ defined in (2.3) we obtain a 
reduced Lagrangian in the form 


L4(e, A, 0,A) = PL(A, 83-A) + O(P), where A = (A1,..., An). 


Here L'°¢ is still an infinite sum over y € I’. However, when expanding in 
powers of £, the multiscale ansatz leads to a limit that is an integral over the 
macroscopic space variable y € R?. The infinite sum can be considered as a 
Riemann sum for the spatial integral. 

Since L*®4 is independent of 7, the solutions of the reduced Euler-Lagrange 
equation conserve the associated energy E obtain as 


(A, A,) = (3-A, Oa, L(A, A,))) F L(A, A,). 


























It is proved in [GHM06] that the Lagrangian equation for A associated with 
the lowest order term L of the reduced Lagrangian L'®*(e, -) really provides 
exactly the macroscopic equation (3.2) derived in Sect. 3.1. 

Here we illustrate this result using a simple example based on the Klein- 
Gordon chain (2.9) with the potential Vo(x) = $a?+ Eat. We consider a single 
modulated pulse in the form 
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z(t) = e? A(et,ej)E + €'/*A(et,ej)E with E = ell+t+99), (4.1) 
where w = 2(0). Inserting this ansatz into £ and using V = e!®—1 we find 
L4(e, A, A,)=Dy (§?|AE-2E/? + €%iw(AE—AB)(A,E+4,B) 
_£|AVE+AdE/? — e? (AVE+A0B) (A,E+4,B) 
sa| AE+AE? — 52|AE+AE|* + o(e?)) 
=eL(A, A,)+ O(e?) with 
L(A, A,) = fg iw(AA,—AA,) — (VAA +0 AA,) — #JAļ* dy. 





The important observation for this calculation is that the lowest order terms 
cancel, which can be seen as a manifestation of equipartition of kinetic and 
potential energy in the plane waves. Moreover, the terms involving E% with 
k Æ 0 also drop out by periodicity. This averaging is a formal procedure here, 
but we will see in the next subsection that in a two-scale setting with an extra 
phase variable it can be made exact. 


Using V—V = 2i sin 0 = 2iw(0)w'() the Euler-Lagrange equation reads 











0 = —0,(A4,L) — dy(Aq,L) + AGL = —2iwA, + iww" Ay — 3b| A|” A. (4.2) 


Of course, this is exactly the desired macroscopic modulation equation, which 
can be obtained as in Sect. 3.1. Moreover, because of invariance in 7, there is 
a first integral, namely the associated energy 











L(A, 3-A) = fp iwa! (AA, — AA,) + 32|Al* dy. 





4.2 Hamiltonian reduction 


In the Hamiltonian setting we might also try to derive the reduced Hamilto- 
nian by inserting the multiscale ansatz (3.1) into the Hamiltonian H defined 
n (2.2). We obtain 


H(e,A,0,A) = e°H(A, 8-A) + O(e2*1). 


In the example of the previous subsection we immediately find o = 0 < p= 
1 and H(A,0,A) = fp 2w?|A|? dy. Moreover, the symplectic form can be 
reduced and we obtain 


RI = 25+ O(c) with Q = 2iw. 


It is easy to see that the function H is also a first integral of the macro- 
scopic system (4.2). However, it is not the desired energy E, and the flow 
associated with the Hamiltonian system oô, A = DH(A) is the phase trans- 
lation A(0,-) + e~7#* A(t,-). The discrepancy is easily understood, because 
in H the leading terms of the kinetic and potential theory are added while 
they cancel in £. Note that H is associated with the phase symmetry of (4.2) 
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that is not present in the original discrete system. It is introduced into the 
problem via the multiscale ansatz and it manifests itself only in the limit. 

Thus, to treat the Hamiltonian limit correctly it is suitable to embed the 
discrete Hamiltonian system into a continuous one that has the corresponding 
symmetries. In this systems we can compensate for drifts in the phases via the 
phase velocity and for drifts with the group velocities by going into suitably 
moving frames. On the level of Hamiltonians this leads to a subtraction of the 
corresponding first integrals. The terms balance in exactly the right way such 
that the same cancellations occur as in the Lagrangian setting. This is the 
content of the following classical result in the theory of Hamiltonian systems 
with symmetry. 


Proposition 4.1. Let (Z,H,Q) be a Hamiltonian system, which is equi- 
variant with respect to the one-parameter symmetry group (Ta)aeR with asso- 
ciated first integral T. Then z : [0,T] > Z solves Q2 = ae ) if and only if 
Z: t Taz(t) solves QF = DH,(2), where Heu = H- 

We illustrate the idea in the pulse propagation problem treated in the 
previous subsection. The continuous Hamiltonian system is defined on the 


cylinder space-phase = = R x S! and has the configuration space L?(=). For 
functions u € L? (Z) we consider the system 


d?u = Aq, oju — au — bu? with a > 0 and 


(4.3) 
Ave,5)u(n, $) = u(n+e, +ô) zz 2u(n, p) F u(n—e, p—6). 
Introducing p = +u this is a canonical Hamiltonian system with 
H (u, p) = f= $p? +$(Vaou) + $u? + but dndg. (4.4) 


Here the important fact is that this system contains the KG chain exactly, 
because the system decouples completely into an uncountable family of KG 
chains just displaced by (7, ¢) € [0,1) x St. Moreover, (4.3) is invariant under 
translations in the spatial direction 7 as well as in the phase direction ¢. This 
leads to the two first integrals 


I®(u,p) = fe pOnudndd and IP®(u, p) = f- pOdgudnd¢. (4.5) 


The flows associated with the canonical symplectic structure and with one of 
these first integral leads to the transport along the corresponding direction 
with constant speed one. 

Using the symmetry of H°°"¢ we can go into a frame moving with the phase 
speed cph = w/6. According to Proposition 4.1 the corresponding Hamiltonian 
is HP® (u, p) = H°™(u,p) — wI (u, p). Into this Hamiltonian we insert the 
suitably adjusted multiscale ansatz (4.1), namely 


u(t,n, p) =e? A(et, en)Epn + €/? Alet, en)Ep 


h; 
plt, n, 9) =£ iw(A(et, en)Epn — Alet, en)Epn) 
+€9/? (0, A(et, en)Eph + 0,A(et, eB, a), 
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where Eph = e'(+%©) does no longer depend on time. Through the subtraction 
of the properly chosen multiple of the corresponding first integral we exactly 
obtain the cancellation of the leading terms. Moreover, integration over ¢ € S! 
makes all terms ES, with k # 0 exactly 0. Hence, the resulting reduced 
Hamiltonian has the expansion 





H?°4(A, 0, A) = cE(A) + O(e?) 





























with E from above. A simple calculation shows that 90,4 = DE(A) is 
exactly the macroscopic equation (4.2). 








4.3 Derivation of KdV from the FPU chain 


Here we apply both the Lagrangian and Hamiltonian reduction from above 
to the FPU chain with KdV-multiscale ansatz, see (3.6). For simplicity we 
restrict to the infinite chain with V\(0) = V/(0) = 0, and we always assume 
that all arising integrals do exist. 

Following the idea in [BP06] we embed the discrete system into a contin- 
uous one. For this example we choose the continuous configuration space Q 
to be L?(R) and identify each discrete configuration (23) jez with an piece- 
wise linear function w = w(n) € L?(R) defined by x; = w(j). Since (3.6), 
ie. w(t, n) = EU (et, e(n+ct)), describes slow macroscopic modulations with- 
out fast oscillations, there is no need for introding phase variables. The La- 
grangian £ of the continuous system is given by L(w, w) = K(w) — V(w), 
with 

V(w) = fe Vi(V+w)dn, K(w) = few? dn (4.6) 
with (Viw)(7) = w(n+1) — w(n). The continuous system is invariant under 
the group of translations, and this gives rise to a further conserved quantity Z. 
Exploiting Noether’s theorem we find the first integral Z(w, ù) = fg ù O,wdn, 
which has no counterpart in the discrete microscopic FPU chain. 

Inserting the ansatz (3.6) into the energies and using fg 0,U0,,Udy = 0, 


AyU OyyyU dy = — f, (OyyU)?dy, and c? = V? (0) we find 
RYVY Yyy R yy 
K(w) = e? + H(U) + € I(U, 8U) + O(e"), 
+ E(U) + O(e"), 
T(w, ù) = e? ct H(U) + e ct I(U, 0,U) + Of"), 














S 
~~ wm’ 
ou 





where 


H(U) = c? fe (QyU)* dy, I(U, 0,U) = c fg 3-U 3 Udy, 
(U) = — dc? fr (Oyy) dy + 4 Vi" (0) fe (QU)? dy. 























Consequently, with L = I — E we find 
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w, w) =e L(U, 0,U) + O(e"), 
H(w, w) = e° H(U) +e 1(U, 0,U) + 5 E(U) + O(e’), 
H(w, w) — eZ(w, w) = E EU) + O(e"), 


























and it follows that the reduced Lagrangian equation equals (3.7). 

In the next step we reduce the Hamiltonian structure. For the microscopic 
continuous system the canonical momentum is given by p = ù with Hamilto- 
nian H(w, p) = K(p) + V(w). For (w,p) the multiscale ansatz (3.6) means 


(w, p) = Re(U)(n) = (eU (en), e4 3-U (en) + e° cOyU(en)), 


where the last term is due to the frame moving with speed c. Reduction of the 
canonical symplectic form Q with (Q (w, p), (w, p)) = J» wp—wpdn leads to 


(Q R.(U), Re(U)) = €2(Q"°4U, Ü) + O(e*) with 
(2U, Ù) = fe (UÜ -Ü 8U) dn = —2 c fg OyU Ü dn. 


From this we conclude 9™°? = —2 c ôy. Note that N°? is defined on L?(R), 
whereas 9 lives on L?(R)xL?(R). This dimension reduction is natural, be- 
cause the multiscale ansatz (3.6) yields a coupling of w and p in leading 
order. Finally it follows immediately that the reduced Hamiltonian equation 
NIU, = DE(U) is again equivalent to (3.7). 














4.4 Derivation of nlS from the KG chain 


We consider the KG chain (2.9) with Vo(a) = $x? + 42%. The sum of the 
kinetic and potential energy gives the Hamiltonian 


A(x, %) = Dyex (89+ 9 (@j41 25)? + $254 725) - 


Since we are interested in modulated pulses, we proceed as in Section 4.2 and 
embed the discrete chain on Z into the cylinder 5 = RxS! leading to the 
continuous Hamiltonian system (4.3) with Hamiltonian H°°"* in (4.4). 

Again we have the two symmetries of spatial translations T® and phase 
translations TP! leading to the two first integrals [SP and JP given in (4.5). 
However, we proceed differently, because we are interested in a dispersive 
ansatz u(t,7) = eA(e7t, e(ntct))E + c.c. + h.o.t., where c = cgr, cf. (3.9). 
Thus, we apply Proposition 4.1 using the symmetry transformation 








(č, p) = TRTE olup) H =H- cI? — (w—c0) P. 


The associated canonical Hamiltonian system R (&, pP) = DH(t,p) on 
L(Z)? is still fully equivalent to a family of uncoupled KG chains. 
Inserting the scaling exposes the macroscopic behavior. For this define 


(u(n, $), p(n, ¢)) = (EU (en, 6—-On), €P(en, 6—On)), 
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which keeps the canonical structure, if we move a factor the £, which arises 
from the transformation rule dy = £ dn, into a the time parametrization 
T = €*t. We obtain the new Hamiltonian 


He(U, P) = Jes gle ([P-wUy—ecty]” + (Vie) 
+aU? — [wPUs-+ecPUy]” )+4U4dydd, 


where V(c,9)U(y, 6) = U(yte, +0) — U (y, ¢). Now we see that the suitably 
transformed version of the modulational ansatz (3.9), viz., 


(U(y,¢), Ply, @)) = Re(A)(y, $) = (Re A(y)e'®, w Re A(y)e"®) + Of), 
leads to the expansion 
He(Re(A)) = Hnis(A) + O(e) with Hnig(A) = fg ww” |A|? + 22|Al4 dy 


and the reduced symplectic structure Q™°? = 2iw. Thus, we recover the one- 
dimensional version of nlS given in Sect. 3.4. 


5 Weak convergence methods 


For static problems there is a rich literature concerning the I’-convergence 
of potential energy functionals of discrete models to continuum models (cf. 
[FJ00, FT02, BG02a, BG02b, MBLO6]). Here we want to summarize some 
first results for dynamic problems that rely on weak convergence. 


5.1 An abstract weak convergence result 


In [Mie06a] it was shown that linear elastodynamics can be derived from a 
general linear lattice model as described in Sect. 2. However, this result used 
exact periodicity and linearity in an essential way. The abstract approach pre- 
sented here will be discussed in [Mie06b] in full details. Its main advantage 
lies in the flexibility, which allows for applications in nonlinear and macro- 
scopically heterogeneous settings. 

We consider a family of Hamiltonian systems parametrized by € € [0, 1], 


Q.(2)z = DH.(z), (5.1) 


and we are interested in the limit behavior for € — 0. Again, € measures the 
ratio between the microscopic and the macroscopic spatial scales, viz., y = €7. 

We consider the situation that all He are defined on one reflexive Banach 
space Z, but may take the value +00 outside the subspace Ze. It is a question 
of general interest to characterize the further conditions on the convergence 
of He to Ho and of R. to Mo such that suitable limits z of solutions ze of 
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(5.1) are solutions of the limit problem (5.1) for e = 0. A first guess would be 
that Ho is the I’-limit of He, i.e. 
(G1) zz = Hol(z)< lim inf He (ze), 
E> 
(G2) VYz€Z a(Ze)ee(0,1): Ze — z and Ho(z) = lim He(Ze). 
B= 





However, we will see below that it cannot be expected in general. 

We assume that the subspaces Z- C Z are closed and that He € C!(Z-,R) 
for £ € [0,1]. Moreover, there exist mappings Ge € Lin(Zo, Ze) such that we 
have 

Ze Ə ze >z € Zo => GŽDH:(ze)— DHo(z) in Zġ. (5.2) 


Finally we assume that the symplectic operators s are independent of z € Z 
and that there exists a larger Banach space W such that Z embeds continu- 
ously and densely into W such that 2, : W — Z* has an inverse operator for 
all € € [0,1] with the norm bounded independently of £. For the convergence 
we ask the condition 


Ze D Ze —>z E€ Zo = GENe%e — Noz in Z*. (5.3) 


Now we use the fact that solutions ze of (5.1) also solve the weak equation 


Jo (DHe(ze(t)), pelt)) + (Reze(t), Pe(t))dt — (Reze pellg =0 (5.4) 


for all pe € C1([0, T], Ze). Choosing y-(t) = G.-y(t) for some y € C1([0, T], Zo) 
and using suitable a priori bounds on z in C°((0, T], Z) 9 C1((0, T], W) it is 
possible to extract a weakly convergent subsequence with z-(t) — z(t) for 
some z € C°([0, T], Zw) AL°([0, T], W). By the assumptions (5.2) and (5.3) 
we pass to the limit in (5.4) and obtain 


Jo (DHo(2), p) + (Roz, p) dt — (Noz, ola =0. 


Under suitable assumptions it then follows that z solves (5.1) for € = 0. 


5.2 Elastodynamics 


The program described in the previous subsection can be applied to poly- 
atomic Klein-Gordon chains, which we also allow to have large-scale variations 
in the stiffness and masses. The KG chains under consideration are assumed 
to have a periodicity of N on the microscopic level, and all quantities may 
change also on the macroscopic scale y = ej. Fork € Zy = {jmodN|j EZ} 
we have given functions mz, ak, bp, Ck E€ L° (IR), which are all bounded from 
below by a positive constant. The KG chain is then given by the canonical 
Hamiltonian system on (? x (? 





i p3 j (ed) 
HÄ (x, p) = 2 (acts +4 LEH (2541-23)? 
J 


(5.5) 
7b] (ej) 2 e*cr5] (ej) 4 
ae + Sey) yt), 
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where [j] = j mod N. To derive a suitable continuum model we embed £? x ? 
into Z = Zo = H! (R) x L?(R) via 


Ze ={ (u,v) € Z | uljejej+e] affine, v|(ej—e/2, ej+e/2) constant }and 


(u,v) = E(x, p) with (u(ej), v(ej)) = (xj, pj) for all j € Z. (5:6) 


The associated Hamiltonian He coincides with H2'S* up to a factor e, which 
relates to the time rescaling, namely He(u, v) = 


Sa ares + Aegu (y)? dy + 2 e(d ulej)? + LID u(e7)4), 
JE 


where M(y,z) = mpy(y) for z € (k—1/2, k+1/2), A(y,z) = ajņ(y) for z € 
(k, k+1) for k € Z, with similar formulas for B and C. 
The important step in the analysis is the construction of the operator 


Ge: Zo > Ze. We define (uz, ve) = Ge(u, v) via ve(y) = Muye oy) and 


Sp Aly, y/u (yT (y)+ue(y)ti(y) dy = fp A (y)u' w +uŭdy 


for all ù with (ù, 0) € Z., see (5.6). Here A* is the averaged stiffness and M* 
the averaged masses 


=(4 JA z) baz) 1 and M*(y )=4 JFM y, 


It is then possible to prove the abstract conditions 5.2 and 5.3, which leads 
to the following results, cf. [Mie06b]. 





Theorem 5.1. Let E; : ( x L — Z = H! (R) x L?(R) be the embedding in 
(5.6). Let (x€, p£) : [0, T/e] — £ x8 be solutions of the canonical Hamiltonian 
system associated with HËS® in (5.5). If for T =0 we have 


(uem) elata) > aoan) 4 
then this convergence holds for all r € [0,T], where (u,v) : [0,T] > Z is a 


solution of the macroscopic wave equation arising from the canonical Hamil- 
tonian system with 


eee ea Wy? 4 SW yA ay, 


where B*(y) = =i" B z)dz and C*(y) = Sod Lfe 


Y, 


It should be noted that Ho is not the T-limit of He when using canon- 
ical variables. However, if we use the Lagrangian coordinates (uf, ù) = 
(u£, M(-,-/£)1pF), then it is the T-limit. 
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5.3 Energy transport via Wigner-Husimi measures 


Waves in dispersive media travel with a speed that depends on their wave 
length. We now discuss this for the general linear model introduced in 
Sect. 2.2. Wave propagation is driven by the group velocity Cer = V2;(9), 
which depends on the wave vector 6 € Tp and the band number j € 
{1,...,m}. Thus, at each macroscopic point y € R? we need to know how 
much energy is located in which band and in which wave-vector regime. 

The relevant mathematical tool is the Wigner measure or the Husimi mea- 
sure, which was used in [Gér91, LP93, MMP94, GMMP97, TP04] to study 
transport of oscillations (relating to energy, density, or other physical quan- 
tities). The case of discrete lattices is analyzed in detail in [Mac04, Mie06a]. 
For this we rewrite (2.6) into diagonal and rescaled form 

2 u(7,6) = Ble, 0)U* (7, 8) with Bie, 0) = ~diag(® (0), very Qm(O)). (5.7) 
The Wigner transform W*[u*] of uë = F~1U® is now defined as a matrix- 
valued distribution on R x Tp. For the diagonal entries it is possible to pass 
to the limit € — 0 and one finds the Wigner measure py = lim (W*[u3]);;. 


























More precisely, we have the following result, see [Mie06a]. 


Theorem 5.2. Let uf : [0, T] — L?(Zp,C™) be a family of solutions for (5.7) 
with ||ue(0)||L2 < C. Let j € {1,...,m} and Sj C Tr be given such that 
Nj € Cl(Tr\S;). If for T =0 we have 


lim (W° [u°] (T))jj = uY (T) in D(R¢xTr) and us’ (0, Rx 55) =0, 


then this convergence holds for all r € [0, T], where uY : [0, T] + M(R¢xTr) 
is a solution of the energy-transport equation 


Op = VQ;(9) + yp on [0, T]xR?xTr. 
Using this result it is possible to obtain the energy distribution by inte- 
gration over 0, namely 
j=1 0€Tp 


The above theorem is restricted to the case that no mass concentrates on the 
singular set S;, where the dispersion relation is not smooth and, hence, the 
group velocity is not defined. However, using the Husimi measure as developed 
in [Mie06a] it is possible to treat this case also in some cases. 


6 Quantitative estimates via Gronwall estimates 


Another technique for the justification of continuum models uses quantita- 
tive estimates to control the error between the macroscopic equation and the 
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Fig. 5.1. Left: energy distribution at t = 200 for the linear chain %; = £j+1 — 2£j + 
zj—ı with initial data x;(0) = 6; and t;(0) = 0. Right: displacement for the square 
lattice Z? with simple nearest-neighbor interaction at time t = 120, cf. [Pat06]. 


microscopic equation. We present the abstract idea in Sect. 6.1 and apply in 
Sect. 7. This method can also be used to prove dispersive stability results as 
discussed Sect. 6.2. 

We work totally in the original microscopic lattice model 


ż = Lz+N(z), (6.1) 


where Z is the Banach space for the state z(t), and L: Z — Z is the linear 
part, which is assumed to generate a bounded semigroup (e*"); 0, i.e. 


ACh >0Yt>0YzEZ:  |le*z|| < CLllz|l. (6.2) 





We also rely on our standard assumption that the solution z = 0 is energeti- 
cally stable, as the Hamiltonian energy is conserved. Here it means 


4Ce >O0V sln. z of (6.1) Vt>0: |l] < Ce|lz(0)||. (6.3) 





The nonlinearity M : Z —> Z is assumed to be locally Lipschitz. However, 
the essential features have to be addressed by using additional Banach spaces 
Y and W such Y C Z C W with continuous embeddings and 


(i) Yz € Z: |zllw <llzl| i) V2 eY: |A] < lzy- (6.4) 
In applications to lattices we have in mind 
Y=4,(7,R*)?, Z=&(F,R*)?, W=2(F,R*). (6.5) 


In Sect. 6.1 the importance is that M satisfies 














C>04v>0V 21, 22 € Z with ||z1||w, ||zollw <1: 
INC) = N(22)I] < Cx (lleallw + llzallw) z122- 


In Sect. 6.2 the importance of Y is the dispersive decay estimate 
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_ K C 
Jx € (0,1)3Cw >0YzEYYt>0:  |letzlw < gll- (6.7) 

of the linear semigroup. For the nonlinearity we then use 
JCw >0da,v>0Vz2EZ: INC lly < Cwlizlýlzle (6.8) 








With Y, Z and W as in (6.5) a standard nonlinearity V((x7)yer) = (n(xy))yer 
with |n(€1) — n(€2)| < C(\€1| + |£2|)f £1 — £2| will satisfy (6.6) with v = 8 and 
(6.8) with v = 8—1 anda = 2. 


6.1 Error control for approximate solutions 


The basic idea is to construct an approximate solution Zapp, which in fact 
will be given in the form Zapp = R-(A), and to derive an estimate for the 
associated error. For any z € C1({0,T], Z) we define the residual via 


Res(z)(t) = 4(t) — Lz(t) — N (z(t). (6.9) 


The following result shows that the smallness of the residual together with 
the stability condition (6.2) implies that the error between Zapp and an exact 
solution is small. 


Theorem 6.1. Assume that the conditions (6.2), (6.41) and (6.6) hold. More- 
over, let Cr, Ca, Tx, 7, a, Q > 0 be given as well as a family (Zipp)ee(0,1) Of 


approximate solutions 25, € C1([0,t./e7], Z) satisfying 


IlZapp(4) lw < Cae” and ||Res(z3pp)(t)|l < Cre? (6.10) 
for allt € [0,7./e°]. Moreover, assume 
o>at+o and vata. (6.11) 


Then, for each d > O there exist co € (0,1) and D > 0 such that for all 
E € (0, €0] any exact solution z of (6.1) with ||z(0)—Z2pp(0)|| < de® satisfies 


l(t) — zpp(t)|| < De®? for t € [0, 7./e"]. (6.12) 


In (6.11) the case va > o is not really interesting, as in this regime the 
nonlinearity is not really active. In the first inequality ọ may be as big as we 
like, what improves the order of approximation in (6.12) but does not allow 
us to extend the length of the time interval, i.e, to make o bigger, because it 
is restricted by the second inequality. 


Proof. For the construction of € and D we define C1 = Cr(d + CRT+) and 


C2 = CrCy(3C a)” and let D = 2Cie©2™ and ceo = min{1,(C4/D)°*}, where 
6=1/(e-—a-—a)>0. 
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We write the exact solution z of (6.1) in the form z(t) = 2,,(t) + eb R(t) 
with 8 = o — ø. Clearly || R(0)|| < d and we have to show ||R(t)|| < D for all 
t € [0,7,./e7]. Inserting this ansatz into (6.1) and applying the variation-of 
constants formula we find 


R(t) = et R(0)+ j ot (N (zEpp(s)-+68R(8))—N (28 pp (8)) -Res(z£pp)(8)) ds. 


Defining r(t) = || R(t)|| and using the available estimates give 
r(t) < Crd + fo Cr (Cw[Cae® + Cae® + ef D]’r(s) + Cree?) ds, 


where we assumed r(s) < D on [0,tp] and t < tp. Note that d < D and r is 
continuous, which implies tp > 0. We will show that tp = 7,/e°. 

Assuming € € (0, €o] we arrive at r(t) < Crd+C,Cre?t+ Coe” ite r(s)ds. 
Because of €t < 7, we find r(t) < Ci +e7C ie r(s)ds and Gronwall’s lemma 
gives r(t) < Cye@2""* < Cye2™ = D/2 for all t € [0, tp]. However, this shows 
that r(t) cannot reach D. As a consequence we may choose tp = 7,./e7 and 
we are done. E 


6.2 Dispersive stability 


Here we present conditions which guarantee that the dispersive decay estimate 
(6.7) for the linear semigroup can be transfered to the full nonlinear problem. 
We follow ideas from [Sch96, MSU01] and refer to [Pat06] for more satisfactory 
results. 


Theorem 6.2. Assume that (6.3), (6.7), and (6.8) hold with vk > 1. Then, 
there exist C,n > 0 such that all solutions z of (6.1) with ||z(0)||y < n satisfy 


llz(t)|\lw < —|z(0)|ly for all t> 0. (6.13) 


C 
(+t) 


Proof. We follow the ideas in [MSU01] Lemma 3 and adapt it to the more 
general case. We rely on 0 < k < 1 < vk, which yield the estimate 


c 


t d vk ; oe (228 Phi 
Jo 1+s)*Y Tes z Š (1+t)* with CuK = (Ba + 2) (6.14) 











This is easily obtained by estimating i ? and Ne separately. Using the 
variation-of-constants formula together with the available estimates we find 


lllw < Beeler + fi ee Colle(s) léyllz(s) leas. 
With r(t) = max{ (1+ s)"||z(s)||w |s € [0,t] } and ô = ||z(0)||y we obtain 


k t CwCnCOCRr(t)” ô” 
AHIO < Cw5 + fy PEE ds. 
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Employing (6.13) and using that r is nondecreasing we find 
r(t) < Cwd+C,6°r(t)” for all t > 0, where C, = cy,.CwCnCf. 


We now choose n such that C.7°(3Cwn)” < Cwn and claim that r(t) remains 
less than 3Cw6 if ||z(0)||y = 6 < n, i.e., the desired assertion holds with 
C = 3Cw. Let tw = sup{t > 0|Vs € [0,t] : r(s) < 3Cwô}, then for 
t € [0,tw] and 0 < 6 < 7 we have 


r(t) = Cwd + C,6%(3Cwd)” < 2Cwd < 3Cwô. 


Since r is also continuous, we conclude tw = oo. E 


The typical application of the above result involves the spaces Y = 4 
and Z = b2 and W = læ. Hence, for a nonlinearity with N(x) = (n(z;))jer 
and |n(x;)| < Cylx;|? we have (6.8) with a = 2 and v = 8—2. Moreover, 
the theory in [Pat06] provides explicity values of x, which can be determined 
directly for the properties of the dispersion relations w = Q,,(@) discussed in 
Sect. 2.2. For this note that e*t can be written as a discrete convolution 


PGES (ene Onan) aes 


where the Green’s functions G,(t) € R?"*?™ satisfy Go(0) = id and G,(0) = 
0 for y # 0. Each component of each G} (t) can be calculated via oscillatory 
integrals of the type 


e 


Sexe el 2x (9)t+9-7) 9(9) dO 


with given smooth functions g. Uniform decay properties in y € I’ for such 
integrals strongly depend on the non-degeneracy of D? Ny (0). Integrating over 
balls in Zr, where det D?.Q;,(0) is bounded away from 0, we easily obtain a 
decay like t~¢/?. However, due to periodicity, degeneracies must occur, and 
the uniform decay is always worse. 

For instance, the one-dimensional FPU and the KG chains from Section 
2.3 lead to k = 1/3, because 2 : St — R has turning points and the third 
derivative is nonzero in these points. As a consequence the above method leads 
to the following very preliminary dispersive decay result. 


Proposition 6.3. Consider the KG chain (2.9) with Vo of the form Vj (£) = 
ax + O(|a|°) for |x| — 0 with a > 0 and B > 5. Then, there exists 5 > 0 and 
C > 0 such that for each initial condition (x(0),x(0)) we have 


lI(<(0), 0) le <6 = MEHO) < TEL for all t > 0. 


This result is still very weak in terms of the restriction on 3, and we refer 
to [Pat06] for improved results . See also [Zua05, IZ05] for related dispersive 
decay results in discrete approximations of PDEs. 
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7 Justification of modulation equations 


In this section we provide rigorous justification results for two examples. In 
contrast to Sect. 5 we will use the quantitative estimates provided in Sect. 6.1. 
The ideas are based on the justification theory developed for general modula- 
tion equations, see [KSM92, Sch94, Sch98] and the surveys [MSU01, Mie02]. 
In particular, we mention the papers [Sch95, Sch05b], which contain examples, 
where the modulation equations, derived formally as in Sect. 3, fail to pre- 
dict the dynamics of the microscopic system correctly. Thus, the justification 
results are needed to validate the formally obtained macroscopic equations. 

To explain the main ideas and still stay sufficiently simple we consider for 
both subsequent examples the d-dimensional, scalar model (3.8). The main 
observation about the multiscale ansatz oe = e°A(ey)E + cc. is that it 
satisfies the estimates 


Neg yerle < Coe? "|| Alle and |[(2*)yerlle.. < Cse” Allure, 


for any s > d/2. Thus, our solutions z = (x,x) will be small only in 
W = ¢..(L)? but may be large in Z = f2(I’)?. However, for using the ab- 
stract approach provided in Theorem 6.1 we need to make the residual of the 
approximate solution Zapp = R-(A) small in Z. This means that the order of 
approximation of the formal ansatz Re in (3.1) has to be taken sufficiently 
high depending on the dimension d. 


7.1 Nonlinear Schrödinger equation 


We want to justify the nonlinear Schrodinger equation 
1 
id, A = divy($D62(6)VyA) + p|A|?A (nlS) 


as a macroscopic modulation equation for the microscopic lattice system (3.8), 
for the formal derivation see Sect. 3.4. We use the dispersive scaling T = €7t 
and y = e(t—Cgrt) for the basic periodic pattern E = ei(¢*+), where w = 
(0) and cgr = Q’ (0). To derive an evolution equation for the macroscopic 
modulation amplitude A : [0,00)xR¢ — C we have to use the improved ansatz 


2) = RE (Ayo = Epai es per, 


where all the coefficient functions Akn can be calculated formally if the non- 
resonance condition of order K holds, namely 


n? (0)? A Q(nO) for n = 0,2,3,..., K. (7.1) 


Of course, we have A = Aj,1, where A satisfies (nlS). The other coeffi- 
cient functions satisfy Ak, -n = Ag» and are either algebraic expressions 


of functions 070) Af, with r+2|s|+pq = k, p < k—1 or (for n = 1, where 
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the non-resonance condition fails) they satisfy some linear inhomogeneous 
Schrodinger-type equations. 

Since all coefficients of the terms e*E” with k = 1, ..., K are equated to 0, 
the residual of the ansatz Zapp = (RE (A), 4 RK(A)) : [0,74/e?] > Z = 4r}? 
satisfies 

|[Res(zapp)(t)lle. < Ce%** Allin and ||Res(zapp)(t)llea < Ce8T- 4? || Alte 


for any suitable s > K+2+d/2. Thus, we have all the ingredients to apply 
Theorem 6.1. However, we note that the dispersive time scale T = £°t needs 
g = 2, while the amplitude ||zapp(4)||e.. ~ €% with a = 1. Now condition 
(6.11) only holds for v > 2. Thus, the nonlinearity M needs to be cubic (cf. 
(6.6)). The following result realizes this condition by assuming V;"(0) = 0, 
see [GM04] for the case d = 1. 


Theorem 7.1. Let K € N with K > 24+d/2 and assume that the scalar d- 
dimensional lattice model (3.8) has potentials Vg € C**?(R) with Vg(0) = 
V3(0) = V3" (0) = 0. Choose a wave vector 6 € Tr satisfying the non-resonance 
conditions (7.1). Let A € C((0, Ts], H* +3(R%, C)) NC ((0, 74], H* +1 (R2, C)) be 
an arbitrary solution of (nlS). Then, for each d > 0 there exist £o € (0,1) and 
D > 0 such that for all € € (0,¢0] any exact solution x of (3.8) with 


\|(x(0), *(0)) — (RE-?(.A)(0), RX—2(A)(0))|le, < de® 1-4/2 
satisfies, for all t € [0, T./e7], 
Il (x(t), X(t) — (REHAN), RETHA Hle < De., 


The condition V4” (0) = 0 allows us to apply the simple abstract result of 
Sect. 6.1. However, this condition is not necessary. In the case of nonlinearities 
that also have a quadratic part it is still possible to derive a similar result if 
we impose more restrictive non-resonance conditions. To treat that case one 
uses ideas from the theory of normal forms to transform the system via a 
near identity transform into a system that has the same linear part but no 
quadratic part in the nonlinearity. We refer to [Sch98, GM06] for positive 
results and mention also [Sch05b] for an example, where the result fails due 
to fact that the more restrictive non-resonance condition is violated. 


7.2 Interaction of several modulated pulses 


We report on results in [Gia06] and consider the scalar d-dimensional model 
(3.8) for which we want to show how the three-wave interaction equations 
(3.11) can be justified in terms of explicit error estimates. Given are three 
wave vectors 0, € Tr and associated frequencies wn with w2 = R? (0n), which 
are in resonance, namely 


0i +62 +683 =0 infr, wi + w2 + w3 =Q. (7.2) 
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Following [Gia06, GMS06] we use the following type of non-resonance condi- 
tion for other combinations of these wave vectors. We set 6_, := —0@, and 
W-n := —wn and say that the mode system { (0n, wn) : n = 1, 2,3} is closed of 
order K, if for all k € {1,..., K} and all n1, ... np € Ñ = {-3, —2, —1,1, 2,3} 
the following holds: 





hia ae ka Sb Ins E N: On, = OF On 
(Xi wn.) = 27 On) ar and wn, =F Wm. 





(7.3) 


Here we use the hyperbolic scaling 7 = et and y = ey and, as explained at 
the beginning of Sect. 7, we need the improved multiscale ansatz 


TETP 


expect to travel with their group velocities and to have interactions with the 
other wave packets. 

As explained in Sect. 3.1 it is possible to determine the coefficient functions 
aoa n, in such a way that the approximate solution Zapp = (RE (A)(t), RE (A)(t)) 
and the residual Res(Zapp) satisfy 


|Zapp()|le. < Ce* with a=1 and ||Res(Zapp)(t) lle. < Cen ti=d/2 


if the triple A = (A1, A2, A3) : [0, T4] — L? (R2, C)? is a sufficiently smooth 
solution of the three-wave interaction equation (3.11). Since rT = 7t with 
o = 1, we may apply Theorem 6.1 with v = 1, which means that nonlinearities 
with quadratic parts are allowed. 

The precise statement from [Gia06] reads as follows. 


Theorem 7.2. Let K € N with K > 1 + d/2 and assume that the scalar, 
d-dimensional lattice model (3.8) has potentials Vg € C*+?(R) with V3(0) = 
V3(0) = 0 for |6| < R. Assume that the mode system {(0n, wn) : n = 1,2, 3} 
satisfies the resonance condition (7.2) and is closed of order K (cf. (7.3)). Let 
A € C((0, 74], H* +? (R4; C)) NC**+((0, 7], H! (R4; C)) be an arbitrary solution 
of (3.11). Then, for each d > 0 there exist £o € (0,1) and D > 0 such that 
for alle € (0,€0] any exact solution x of (3.8) with 


I((0), X(0)) = (RE(A)(0), RE*(A)(O)) leg < de= 
satisfies, for all t € [0,7./e], 
IEO, (8) — (REA), REA) leg < De? 


The whole theory can be generalized in several aspects. First we may con- 
sider mode systems with N different wave vectors, where N > 4. Then, we ob- 
tain a system of N equations for Aj, ..., Ay, where only those quadratic terms 
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An Ang occur in the equation for 0;An, if the three modes (On,,Wn,)1=1,2,3 
satisfy the resonance condition (7.2). Other triple interactions do not mat- 
ter on this time scale either because the frequencies or the wave vectors do 
not resonate. Quadruple or higher interactions are too small in amplitude to 
influence the macroscopic behavior (cf. [Gia06]). 

Second it is possible to do the very same analysis for systems rather than 
for a scalar problem. Of course, then we have to pay attention to the different 
frequency bands. We also refer to [GMS06], where multipulse interactions 
are treated for nonlinear Schrödinger equations with periodic potentials, see 
[(CMS04, Spa06]. 

Similar phenomena arise in such different subjects as phonon collisions (cf. 
[Spo05]) and in surface water waves (cf. [SW03]). 
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Summary. We present an overview of recent results concerning wave trains, soli- 
tons and their modulation in FPU chains. We take a thermodynamic perspective 
and use hyperbolic scaling of particle index and time in order to pass to a macro- 
scopic continuum limit. While strong convergence yields the well-known p-system of 
mass and momentum conservation, we generally obtain a weak form of it in terms 
of Young measures. The modulation approach accounts for microscopic oscillations, 
which we interpret as temperature, causing convergence only in a weak, average 
sense. We present the arising Whitham modulation equations in a thermodynamic 
form, as well as analytic and numerical tools for the resolution of the modulated 
wave trains. As a prototype for the occurrence of temperature from oscillation-free 
initial data, we discuss various Riemann problems, and the arising dispersive shock 
fans, which replace Lax-shocks. We predict scaling and jump conditions assuming a 
generic soliton at the shock front. 


1 Introduction 


We consider chains of N identical particles as plotted in Fig. 1.1, nearest 
neighbor coupled in a nonlinear potential @ : R — R by Newton’s equations 


Le = P'(£a+1 = La) = P' (£a = Li), (1.1) 


where © = is the time derivative, x(t) the atomic position, and a = 
1,..., N the particle index. Since the work of Fermi, Pasta and Ulam [FPU55] 
one usually refers to (1.1) as FPU chains. 

We mainly consider general, convex potentials @. While our focus lies 
on nonlinear @’, the harmonic potential with linear forces is an instructive, 
completely integrable example. A nonlinear example, but still completely in- 


tegrable, is the famous Toda chain, see [Tod70, DK*74, Hén74] with potential 


(r) = exp (1 — r)— (1 — r). (1.2) 
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Ta-1 Ta Ta+1 Ta+2 


—@— vwe — vw — e — vwe- 


—_—_—_—___._._» 
Ta 


Fig. 1.1. The atomic chain with nearest neighbour interaction. 


For our purposes it is convenient to use the atomic distances rg = Za+1 — La 
and velocities va = ta as the basic variables, changing (1.1) to the system 


Pa =Vati-Va; va = P'(ra)— P (ra—1). (1.3) 


Rather than investigating solutions of (1.3) for finite N, we focus on the 
thermodynamic limit € = 1/N — 0 in the hyperbolic scaling of the microscopic 
coordinates t and a, which is defined by the macroscopic time t = et and 
particle index @ = ea. It is natural to scale the atomic positions in the same 
way, i.e. T = ex, which leaves atomic distances and velocities scale invariant. 
For a survey on other reasonable scaling we refer to Sect. 2 and [GHM06a]. 


Our main goal is to derive a micro-macro transition for the atomic chain, 
i.e. we aim to replace the high dimensional ODE (1.1) by a few macroscopic 
PDEs. The derivation of such a continuum limit is simple as long as the 
atomic data vary on the macroscopic scale only, see (1.9) below. If, however, 
the atomic data oscillate on the microscopic scale, the problem is tremen- 
dously complicated, because then distances and velocities do not converge 
to macroscopic functions. Instead, at each point in the macroscopic space- 
time, the local (r, v)-distribution converges to a nontrivial Young-measures, 
see Sect. 5. We interpret the microscopic oscillations as a form of temperature 
in the chain, see Sect. 2, and refer to oscillation-free limits as cold. 

The main problem in the case of temperature is to find an appropriate 
description for the structure and evolution of the oscillations. Even if we are 
interested in the macroscopic behavior of averaged quantities only, the mi- 
croscopic oscillations determine the evolution of the internal energy, that is, 
the amount of energy which is stored in purely microscopic motion. In other 
words, any reasonable macroscopic limit for oscillatory solutions needs to de- 
scribe thermodynamic effects, such as creation of temperature and transport 
of heat. Unfortunately, no rigorous theory is known that applies without fur- 
ther assumptions. 

Numerical simulations as discussed in Sect. 5 and Sect. 6, as well as rigor- 
ous results for the Toda chain, cf. [HFM81, Kam91, VDO91, DK*96, DM98}, 
suggest that for certain solutions of (1.1), the arising microscopic oscillations 
can be described by modulated traveling waves. Traveling waves are highly os- 
cillatory exact solutions of (1.1). The most relevant kind for our purposes are 
wave trains which are periodic functions of a single phase variable, depending 
on four characteristic parameters. Modulated wave trains arise when these 
parameters vary on the macroscopic scale. 
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A characteristic property of wave trains is that the arising Young measures 
are supported on closed curves. As a consequence, they correspond to a very 
special kind of temperature which is not related to our usual notion of ther- 
malization. However, they give rise to a thermodynamically consistent macro- 
scopic theory involving temperature, entropy, and so on. Moreover, if cold 
initial data form macroscopic shocks, then Newton’s equations self-organize 
into microscopic oscillations in form of modulated wave trains, and in this 
sense our notion of temperature turns out to be generic. 


Some aspects of the problems addressed in this article have much in com- 
mon with certain zero dispersion limits, which we will briefly discuss next 
to illustrate our point of view. This is not to be confused with so called zero 
diffusive-dispersive limits, where diffusive effects prevail, cf. e.g. [Sch82, KL02]. 
The most prominent example is Burger’s equation 


Ozut+udzgu= 0, (1.4) 
which, on a formal level, is the zero dispersion limit of the KdV equation 


Ozutudgut+edsu=0. (1.5) 





The main question is under which conditions the solutions of (1.5) converge as 
E — 0 to (weak) solutions of (1.4). The rigorous theory for this problem was 
developed by Lax and Levermore in [LL83, Ven85] by relying on the complete 
integrability of (1.5). 

It is well known that generic initial data uini yield a critical time terit such 
that (1.4) has a unique smooth solution for 0 < t < terit, but for È > terit 
solutions exist in a weak sense only, having at least one discontinuity at some 
Gerit, and satisfying Ozu + 40m (u?) = 0 in the sense of distributions. 

Imposing the same initial datum Uini for KdV, the typical behavior for 
€ — 0 is as follows, see for instance the surveys in [Lax86, Lax91, LLV93]. For 
0 < T< ferit the solutions uz of (1.5) converge strongly to the unique smooth 
solution of (1.4). However, for € > terit the KdV-solutions become highly 
oscillatory in a neighborhood of @rit with wavelength 1/,/e, and converge to 
a weak limit (u) only. The main point is that (u) does not satisfy Burgers 
equation, i.e. 07 (u) + 407 lu)? £0, because (u)? £ (u?). 

A discrete zero dispersion limit was studied in [GL88], replacing (1.4) by 


tha + 4 Ua (ua+ı — Ua-1) = 0. (1.6) 


This scheme is equivalent to a dispersive spatial discretization of (1.4), because 
the identification ua(t) = u(et, ea) transforms (1.6) into 0;u + uV**u = 0 
with V**u(#, a) = (u(t, a +e) — u(t, @—¢))/2e, The numerical study in 
[GL88] provides evidences for the same qualitative limiting behavior as for 
KdV. Further examples for numerical schemes with interesting zero dispersion 
limit can be found in [HL91] and [LL96]. 
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Towards modulation theory, [GL88] found a simple description for mod- 
ulated binary oscillations, which provides an approximate solution of (1.6) 
satisfying Ua © Va + (—1)° wa, where va and Wa vary on the macroscopic 
scale only. The modulation equations for binary oscillations read 


O7a+a0qb=0, 07b+b0qa=0. (1.7) 


where a = v + w and b = v — w. This system is strictly hyperbolic if and only 
if a and b have the same sign, and conservation laws for lna and lnb imply 
that strictly positive initial data stay positive for all times. 

Let a = a(t, Œ) and b = b(t, Œ) be a smooth solution of (1.7) defined until 
terit, and denote the corresponding modulated binary oscillations by 


umed(t) = v(et, ca) + (—1)* w(et, ca). (1.8) 


It is proven in [GL88] that (1.8) indeed yields approximate solutions of the 
microscopic system for t < e7 !terit in the sense that u™°4(t) — ua (t) converges 
to zero as £ > 0 for t < Terit if ua(0) = u®°4(0). For larger times we expect 


that modulated binary oscillations are not longer close to an exact solution. 


Returning to the atomic chain (1.1), we next derive the macroscopic evo- 
lution of cold data, i.e. we assume macroscopic fields r(t, œ) and v(t, œ) such 
that ra(t) = r(et, ca), va(t) = v(et, ea). Substitution into (1.3) yields 


ôr- Vtv = 0, O;u—V-*S'(r) = 0, (1.9) 


In the limit € — 0 we formally obtain the so called p-system consisting of the 
macroscopic conservation laws for mass and momentum given by 


Ozr—Oqv=0, Ozv—dqH'(r) =0. (1.10) 


It is well known that, for convex $, the p-system is hyperbolic and that for 
smooth solutions the energy is conserved according to 


dz (Su? + 8(r)) — Og (v P'(r)) =0. (1.11) 


In analogy to the previous discussion, the p-system describes the thermody- 
namic limit for cold atomic data as long as these data are smooth on the 
macroscopic scale. However, we will show next that, if the nonlinearity forms 
a shock, then the p-system is no longer a thermodynamically consistent model 
for the macroscopic evolution. For simplicity, we assume that the flux func- 
tion ©’ is convex so that all eigenvalues of (1.10) are genuinely nonlinear. 
According to the Lax theory of hyperbolic system, cf. [Smo94, Daf00, LeF 02], 
a shock wave propagates with a constant shock speed c so that r and v satisfy 
the Rankine-Hugoniot jump conditions across the shock given by 


—e[r] — [v] =9, ——ellv] — [$ (r)] = 0, (1.12) 
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where ||- || denotes the jump. However, (1.12) implies that the jump condition 
for the energy must be violated, i.e. for shocks with (1.12) we have 


-ejk + (r) — Her] #0. 


Consequently, the p-system predicts some production for the macroscopic en- 
ergy (the Lax criterion selects only shocks with negative production). 

In contrast, Newton’s equations always conserve mass, momentum and 
energy. Therefore, the p-system cannot describe the thermodynamic limit be- 
yond the shock at which the atomic data start to oscillate. Indeed, some 
amount of energy is dissipated into internal energy leading to a dispersive 
shock fan. It is one of the merits of modulation theory that it can describe the 
microscopic oscillations emerging from cold shocks as discussed in Sect. 6. 


The article is organized as follows. In Sect. 2: we briefly sketch the thermo- 
dynamical framework. We survey some existence and approximation results 
of wave trains and solitons in Sect. 3, including multi-phase wave trains, 
thermodynamic properties and new a priori estimates. Section 4 gives a brief 
overview on Whitham’s modulation theory applied to FPU chains, leading 
to a system of four conservation laws for wave train parameters. In Sect. 5 
we briefly summarize some aspects of numerical justification by evaluating 
the aforementioned Young measures and testing assumptions of modulation 
theory. The shock problem for cold Riemann data is discussed in Sect. 6, and 
we characterize the behavior of all macroscopic fields at the shock front by 
assuming that is consists of a generic soliton. 


2 Thermodynamic framework 


Thermodynamics describes the evolution of deformation and heat on the 
macroscopic scale in a body, which may be isolated from the surroundings 
or is subjected to external supply of mechanical forces and heat. In the fol- 
lowing we will illustrate the strategy of thermodynamics for a macroscopic 
body in one space dimension, that is microscopically constituted by an atomic 
chain. To this end thermodynamics considers, at any Lagrangian space-time 
point (t,ā) a certain number of specific densities u;(t,ā), j = 1...M, and 
determines these fields by means of a system of PDEs for given initial and 
boundary data. The most important densities in 1D are the specific volume 
(mean distance) r, the macroscopic velocity v, and the specific total energy 
E = ty? + U, uniquely decomposed into kinetic energy and specific internal 
energy U. 


The PDE system relies on M equations of balance that read in regular 
points 
zuj + zF; = Pj, j € {1,2,..., M}, (2.1) 
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where fj and Pj are called flures and productions, respectively. The funda- 
mental balance equations are the conservation laws for mass, momentum and 
energy given by 


Oz7r—-Oqv=0, O7v+O0qp=0, O7E+0cf =), (2.2) 


where p denotes the pressure and f is the energy flux, satisfying f = pv + q 
with heat flux q. Further conservation laws are possible, but those are material 
and process dependent. 


In order that (2.1) becomes a closed system for the variables, thermody- 
namics has to model constitutive equations that relate, in a material depen- 
dent manner, the fluxes and productions to the densities themselves and/or 
their time and space derivatives. The generality of the constitutive functions 
is restricted by universal principles like Galileian invariance and the entropy 
principle, and by material dependent symmetry principles. The entropy prin- 
ciple consists of several parts, see [MR98] for more details. 


1. There exists an entropy pair (S, g), given by (material dependent) consti- 
tutive functions in terms of the densities uj, so that the entropy density 
S is a concave function. 

2. The constitutive laws closing (2.1) yield a further balance equation 


d;8+0ag=5>0 with Y=0<>5P,=0, (2.3) 


where X denotes the non-negative entropy production. 
3. The definition of (absolute) temperature Tmacro is given by 


OU 
de ch (2.4) 


Note that this phenomenological definition is a priori unrelated to any 
microscopic model. 
4. The law of Clausius-Duhem holds, i.e. Tmacrog = 4. 


This abstract framework is the basic paradigm of Rational Thermodynam- 
ics and assumed to hold in all cases. However, the constitutive laws depend on 
the chosen macroscopic scaling and are generally unknown. Nevertheless, the 
scaling predicts the way in which the fluxes (and productions) can and cannot 
depend on the densities. For instance, in the hyperbolic scaling, the macro- 
scopic equations must be invariant (to leading order) under (t, @) + (At, AQ), 
whereas the parabolic scaling (t, @) = (€?t, ea) implies macroscopic invariance 
under (t, @) +> (A7#, AQ). 

Therefore, in the hyperbolic scaling all constitutive relations for the fluxes 
must be local, ie. Fj depends pointwise on the densities uj, so that (2.2) is 
a first order system. We mention that hyperbolicity of (2.1) is guaranteed if 
the entropy S is a concave function, see [MR98]. Generally, for the hyperbolic 
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scaling, all constitutive relations can be encoded in a Gibbs equation with 
a single thermodynamic potential. On the other hand, for parabolic scaling, 
we expect that the fluxes depend (mainly) on the spatial derivatives of the 
densities. In the simplest case the energy flux f is given by Fourier’s law, i.e. 
f = aU, so that the energy balance leads to the heat equation. 

In conclusion, we sketch the macroscopic thermodynamics for the atomic 
chain (1.1) as it results from modulated wave trains. It turns out that the 
macroscopic system (2.1) consists of the three fundamental and a fourth equa- 
tion, the conservation of wave number 07k — Oqw = 0, with wave number k 
and frequency w. In addition, there is a fifth conservation law for the entropy 
S, i.e. X = 0 in (2.3), and all fluxes are given by the thermodynamic potential 
U = U(r, k, S) through the Gibbs equation 


dE =wdS — pdr — gdk + v dv. (2.5) 


Note that the equation of state depends on the chain, i.e. on the potential $, 
whereas (2.5) is universal. From (2.4) and (2.5), we infer that the macroscopic 
temperature Tmacro equals the wave train frequency w. Interestingly, here there 
is a difference between Tmacro and the kinetic temperature defined as the 
mean kinetic energy of the atoms, see Sect. 3. However, it turns out that the 
Clausius-Duhem law is satisfied, i.e. we find q = wg. 


3 Traveling waves 


Traveling waves are exact solutions of the infinite chain (1.1) for N = oo of the 
form xq(t) = z(a — ct) depending on a single phase variable ¢ and traveling 
with a constant speed c. In the context of the macroscopic limits that we 
consider, relevant traveling waves are solitons, which vanish as ¢ — +oo, and 
wave trains, which are periodic in ¢. Due to Galilean invariance, we can allow 
additional drift in space-time of the form 





Talt) = ra + vt + Yalt), 
where the profile ya(t) solves the modified lattice equations 
Üa = P(r + Yatı — Ya) — P(r + Ya — Yo-1) (3.1) 


and traveling waves y(t) = Y(a — ct) solve the second order advance-delay 
differential equation 


asl) = G(r + ¥(¢ +1) — Y(¢)) -P(r +Y) - Y- 1)) (3.2) 


3.1 Wave trains 


Normalizing the period of wave trains to 1 and using c = w/k with wave 
number k and frequency w, we obtain the form 
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Talt) = ra + vt + X(ka + wt), (3.3) 


where X(vy) is the 1-periodic wave profile function. There are unique choices 
for the average distance r and the average velocity v such that J X(y)dọ = 0. 
Upon substitution into Newton’s equations, we obtain the analogue of (3.2) 


Ww OppX = P'(r + XY + k) —X(y)) =- P(r + X(~)- X(p—k)), (3-4) 
with the three parameters r, k, w. Another useful formulation is the fixed 
point equation, or nonlinear eigenvalue problem, for V = 0,X 


wV = F(V) := A,ae (r ob ÂV) l (3.5) 


where the operator Aj, and the Nemyckii operator O® are defined by 
p+k/2 


(ÂV) (e) := AV (p) — k f V(s)ds, AxV(y) := / ss V(r)dr 


IP(V)(y) = P (V(p)). 


Distances and velocities of the microscopic wave trains are then 


Talt) =r + AkV(ka+wt+k/2), Valt) =v+wV(ka + wt). (3.6) 


Existence and approximation of wave trains 


We give an overview of the variational approach to wave train existence and 
approximation by numerical schemes that are based on maximizing 


W(¥) = [ 20+ 4) dy, V € Hy := fv € L°([0,1]) : Tvi- < r}. 


Problem 3.1. For given parameters r, k and y > 0 we seek maximizers of W 
in H}, i.e. we solve W(r, k, y) = max wv(y). 
eH, 


Theorem 3.2. Problem 3.1 always has a solution. In particular, there exists a 
maximizer Y with || V||,2= v27 together with a positive Lagrangian multiplier 
w? > 0 such that Y and w? solve (3.5). 


Scheme 3.3. Let any parameter set for problem 3.1 be given, and let Vo € Hy 
be an arbitrary initial datum with AyVo 4 0. Then we define inductively two 
sequences (Vn) nen C Hy and (wn) nen by the following iteration step 


= 1 = [Wall = 
Vani = Wors Wr=FVn, n= JA Wn+1 = V fn- 


In [Her04] it is proved that this scheme is compact, and numerical simulations 
indicate that Scheme 3.3 converges. 
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Remark 3.4. In fact, Theorem 3.3.2 in [Her04] proves that every closed cone 
of functions that is invariant under F contains at least one traveling wave. 
For the cone of even monotone functions used below, this was also shown in 
[FV99], Theorem 2.14. 


By means of Scheme 3.3 we can compute wave trains with prescribed 
parameter y = $ || V||72. There are variants of 3.3 which allow to prescribe 
either the entropy S or the temperature T of a wave train (for the definition 
of S and T see Sect. 3.1 below). Hence, wave trains are parametrized (at least) 
by (k,r,y), as well as trivially by v; the latter is relevant for the modulation 
equations discussed in Sect. 4. On may view the parameter w of the wave 
train equation (3.4) depending on (r,k,7) via a ‘dispersion relation’, here 
expressed as the Lagrange multiplier. We emphasize, that it is not known 
whether the set of wave trains is a smooth three-dimensional manifold of 
orbits; note that phase shifts V(. + s) trivially give rise to an (at least) one- 
dimensional kernel of the linearization w? — DF (V) of (3.5) spanned by ôV. 
Moreover, for given parameters there is a discrete multiplicity of solutions, 
because solutions for mk, m € N, are solutions for k as well, though these 
do not have minimal period 1. We conjecture that wave trains are unique in 
cones defined by monotonicity properties of V as discussed below. 


Existence and approximation of multi-phase wave trains 


We present new results concerning the existence of multi-phase waves, which 
will be published with full details elsewhere. As before, our variational ap- 
proach is essentially restricted to convex interaction potentials , but allows 
for arbitrary large amplitudes. 

For simplicity we consider only two-phase wave trains having two wave 
numbers kı and k2. However, all results can easily be generalized to other 
multi-phase wave trains. Moreover, to avoid technicalities we always suppose 
that ® is defined on the whole real axis with bounded and continuous second 
derivative &”. 

A two-phase wave train is an exact solution of Newton’s equations satis- 


fying 
Talt) = ra + vt + X(kia + wit, kaa + wet). (3.7) 


Here r, v, kı, kg, w1 and w2 are given parameters, and the profile function X 
is assumed to have zero average and be 1-periodic with respect to each phase 
variable y; = kia + wit. The ansatz (3.7) gives rise to the advance-delay 
differential equation 


(w 82, + w5 02,)X = V_OG(r + VFX) (3.8) 





where V= are difference operators defined by 


(V*X) ($1, p2) == £X(y1 + kı, p2 + k2) F X(%1, p2) 
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Our aim is to identify an optimization problem with a single scalar constraint 
such that (3.8) is equivalent to the corresponding Euler-Lagrange equation 
with multiplier w?. Consequently, we regard the ratio 8 = w2/w as parameter 
of this problem. 


Let 72 S [0, 1] x [0, 1] be the two dimensional torus with its canonic Lebesgue 
measure, and let all function spaces which follow be defined on 73. We consider 
the Sobolev space 


aj={xem': f x=0}, IX ye= f 0a +5" (O,,X)*. (3.9) 


This norm is equivalent to the standard norm on Hj as long as 8 # 0. Let 
E be the canonic embedding E : Hj — L?, and E* its adjoint operator E* : 
PoHot= (H4). Note that E is compact, and that here we have identified 
L? with its dual L?*. By A we denote the Laplace operator corresponding to 
(3.9), i.e. 


A= 8, + 68, 


Recall that —A : Hj — H~! is an isometric isomorphism between Hilbert 
spaces, and that the difference operators V+ : L? — L? are continuous with 
(V+)" = —V~-. Moreover, our assumptions on ® imply that the convex func- 
tional X > fz, ®(X) is well defined and continuous on L?. 

The spaces and operators from above allow to regard the wave train equa- 
tion as an equation in H~!. In particular, (3.8) is equivalent to 





-w AX = E*(Vt)" 08(r + VT EX), (3.10) 


where the Nemyckii operator 0@ : L? — L? with X + '(X) is the Gateaux 
differential of the functional (3.11). For fixed y > 0 we define the closed convex 
set H, € L? and the convex Gateaux differentiable functional W as follows 


H, = {x eM: tX 7} , W(X) := | S(V*EX). (3.11) 
Ta 
Now (3.10) yields the following constrained optimization problem. 


Problem 3.5. For given parameters r, kı, k2, y > 0 and 8 Æ 0 we seek 
maximizers of W in H}, i.e. we solve 
k = X). 
W(r, ki, k2, B, 7) a (X) 
Theorem 3.6. Problem 3.5 always has a solution. In particular, there exists a 
maximizer X with || X||qi= v27 together with a positive Lagrangian multiplier 
w? > 0 such that X and w? solve (3.10). 
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Remark 3.7. By construction, (3.10) is an identity in H~'. However, X € H! 
implies that the right hand side of (3.10) is again an element of H!, and the 
theory of elliptic regularity provides X € H?. Moreover, we can prove further 
regularity by exploiting Sobolev’s embedding theorems. 


In analogy to the single-phase wave trains, we can solve the optimization 
problem 3.5 iteratively using an adapted abstract approximation scheme. 


Geometry and phase velocity of wave trains 


Since the modulation equations for the macroscopic limit of the chain depend 
on wave trains, it is essential to understand properties of wave trains and their 
parameter variation. Motivated by numerical simulations, we investigate geo- 
metric properties of wave trains in the phase space of distances and velocities. 
With the shock problem in mind, see Sect. 6, we are also interested in the 
transition to solitons as the wave number tends to zero. 

From (3.5) we infer that if (V, w, k, r) is a solution to (3.5), then (—V,w, 1— 
k,r) is also a solution, and vice versa: V(y; k) = —V(y + 1/2; 1 -— k). In case 
of a binary oscillation, k = 1/2, the symmetry implies that (3.4) reduces to 
the planar Hamiltonian ODE 


wOopX = P' (r — 2X) — P' (r + 2X). (3.12) 


More generally, for rational k = n/m equation (3.4) can be written as an 
m-dimensional second order Hamiltonian ODE with components X; = X(- + 
jn/m), j = 0,...m — 1. This system is equivariant under the Zm action 
X; — Xj41, where indices are taken modulo m, and Zm lies in the isotropy 
subgroup of wave trains. 

The microscopic phase space of distances and velocities is in fact the phase 
space of the ODE (3.12) for k = 1/2. Therefore, the orbits 


Q := {(r + AV (y t+ k/2),v+wV(y))|0< ye <1} (3.13) 


are convex, non self-intersecting curves and nested for different w with fixed 
(r, k). 

We can prove some of these properties for general wave number k, see also 
Fig. 5.4, and define the positive cones 





Ma := {V1 +9) = ay )ds = 0, V(y) = V(—9), 


sgn(V(y1) — V(p2)) = 1, 0 < p1 < p2 < 1/2}, 





so that W € M+ has unique global extrema at y = 0 and y = 1/2, and 
W(y1) = W(v2) is equivalent to yi € {p2, —p2, 1 — y2}. By symmetry W € 
Mz, implies W(- + 1/2) € Mz. The basis of the following results is the 
observation F : M+ — M+, which was noted in [FV99]. Throughout this 
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article, we are only interested in wave trains in M4, U M_, and conjecture 
that wave trains are unique within these cones. 

Let Qo = {@"(r + ArV(y~)) | 0 < p < 1}. We will estimate the phase 
velocity Cph := w/k of wave trains and the size of Q in terms of 


M :=maxQa(V), m:=minQa(V). 


Remark 3.8. Note that ym, VM are the characteristic velocities of the p- 
system. Applying Theorem 3.10 below for monotone ®”, these values are at- 
tained at y = 0,1/2, respectively. 


Next, we report our main, new results concerning the general geometry 
and phase velocity of wave trains; full proofs will appear elsewhere. 


Theorem 3.9. Assume P” > 0. Then (8.5) has solutions (w,V) € Ro x M+ 
for any (y,7,k) € R? x (0,1/2) such that k — 0 & w — 0. 
More precisely, the phase velocity of these wave trains satisfies 
vam < H < vi, (3.14) 
where b(k) € (0,1/2), b(k) = 1/2 for k < 2p, with V(y.) = 0 the unique root 
of Y. For 2p, < k < 1/2, we can take 
E4 VA) + [Ve + 1/2) 7d 
1 Jena VP + IV + 1/2)Pde 
a | | 
IV lo 
Theorem 3.9 states that the lower phase velocity bound of wave trains is 
estimated by a correction of the p-system characteristic. Indeed, small am- 
plitude wave trains have m ~ ®"(r), so that the harmonic phase velocities 
J/msin(rk)/rk apply, which, being smaller than ./m, necessitate a correction 
such as b(k). 


Theorem 3.10. Assume P” > 0 and consider solutions V E€ M, to (8.5) 
for given (r,k,y). Then the curve Q is smooth, closed, convex and non self- 
intersecting. Its unique extrema in r-direction lie at p = —k/2,(1—k)/2 and 
in v-direction at p = 0,1/2, and it is bounded by |w| < kVM and 


SRIV(y)| < AVO] < VEINI: (3.15) 


If ®” is monotone, and w independent of (r, k), then w is a strictly monotone 
function of y and the curves Q are nested near the extrema in r- and v- 
directions for fixed (r, k). The sign of monotonicity is that of w. 


While the unique points on Q with vertical and horizontal slope lie at 
yp = (1 —k)/2,1/2 and y = 0,—k/2, the limiting profile for k — 0 is not 
necessarily parametrized by y. Indeed, for a limiting soliton, we expect that 
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only one of these pairs converges to the point (r, v) of the soliton’s background 
state as k — 0. 

The harmonic dispersion relation (3.18) renders w a function of k, so that 
the last part of Theorem 3.10 does not apply. Indeed, in this case V is inde- 
pendent of k and w, see (3.17). 


Remark 3.11. The estimates on the size of Q imply that a nontrivial solitary 
limit as k — 0 requires unbounded norm parameter y, growing at least like 
1/k. Since Theorem 3.9 also implies w — 0 as k — 0, we expect that the 
monotonicity of y in w holds in general for small w. 


Nestedness of Q near the extrema in r and v directions for fixed (r, k) is a 
biproduct of our approach. However, it seems difficult to prove the numerical 
observation that the entire phase plots are nested. 


Thermodynamics of wave trains 


Wave trains represent exact solutions of Newton’s equations which are highly 
oscillating on the microscopic scale. However, on the macroscopic scale we 
cannot resolve the microscopic oscillations but must pass to a thermodynamic 
description involving energy, pressure, temperature and the like. It turns out 
that for each wave train all these thermodynamic quantities are constant on 
the macroscopic scale. As a consequence, we can interpret each exact wave 
train solution of Newton’s equation as a “thermodynamic state” of the chain. 
This idea turns out to be fruitful in modulation theory discussed in Sect. 4, 
where we allow for macroscopic modulations of the thermodynamic states. 

Most thermodynamic quantities are defined as mean values of the oscillat- 
ing atomic data in a wave train solution: 


1 
W = t @(r+ AkV(p))dy specific internal potential energy density, 
0 

1 

p= f P'(r + AxV(y)) dy pressure = negative specific force density, 
0 
w? f 

K= T | Ví)? dy specific internal kinetic energy density, 

0 


and further 


T =2K kinetic temperature, 
F=kK-—W = specific internal action density, 
U=k+W = specific internal energy density, 


1 
E = ~ +U specific total energy density. 


Note that K = w?y, where y is norm parameter used above. All these quanti- 
ties are constants for exact wave trains. However, in modulation theory they 
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become fields in ¢ and @ whose macroscopic evolution is described by the 
modulation equations. In particular, although all quantities are defined by in- 
tegrals over the phase variable y, in modulation theory they represent specific 
densities. 

There are other important thermodynamic quantities which are not di- 
rectly related to mean values of the atomic data. It turns out that 


Sis J VG) doy. e f V(y)#'(r + A,V(y)) dy (3816) 


can be interpreted as the macroscopic entropy density and entropy flux, respec- 
tively. It is proven in [Her04, DHMO06] that any smooth family of wave trains 
provides an equation of state together with a corresponding Gibbs equation. 


independent thermodynamic Gibbs equation 


variables potential 

(r, k, y) W =W(r, k, y) dW = w?dy — pdr — gdk 
(r, k, w) F= F(r,k,w) dF=Sdw + pdr + gdk 
(r, k, S) U =U(r, k, S) dU =wdS — pdr — gdk 


The different variants of equations of state and Gibbs equations are all equiv- 
alent as long as the respective changes of coordinates are well defined. 

The Gibbs equation becomes very important in modulation theory, where 
it provides the closure for the modulation equations. In particular, if the 
equation of state is known, then all other constitutive relations are determined 
by the Gibbs equation. 


Examples for wave trains 


For a few specific potential, explicit expressions are known for both the profile 
functions and the equation of state. The following examples are taken from 
[Her04, DHMO06]. 

The harmonic chain with interaction potential P(r) = co + cır + 2 rĉ. 
Here the linearity of 8’ implies that (3.5) may be solved by means of Fourier 
transform. Some simple calculations yield the following family of traveling 
waves, parameterized by (r, k, y), 


Viv) = 2/7 sin (27), AgV(yp +k/2) = (sin (r k)/r)V(). (3.17) 


Here V is independent of (r, k) and A,V independent of r. Degeneracy of the 
harmonic chain is also reflected by the harmonic dispersion relation 


w(k) = yczsin (nrk) /r, (3.18) 


which provides the frequency w as function of k, and does not depend on r or 
y. Consequently, for the harmonic chain we cannot choose r, k and w as set of 
independent variables. From (3.17) we infer that the equation of state reads 
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1 2 2 
W(r, k, y) =o+tar+z er +w(k) y, 


which implies g(r, k, y) = —c2 sin (2rk)y and S(r, k, y) = 2w(k)y. Moreover, 
we can replace y by S, and obtain 


1 
U(r, k, S) = co + cr + sor + w(k)S. (3.19) 


The hard sphere model with interaction radius ro. Here all atomic inter- 
action are modeled as elastic collision between hard spheres with radius ro. 
This gives rise to an interaction potential @ with 


P(r) = +œ for r<ro, Pr) =0 for r> ro. 


Although this potential is not smooth the notion of traveling waves may be 
generalized to this case, and again we are able to derive explicit expressions 
for wave trains. Some basic arguments lead to the following family of traveling 
waves, parameterized by (r, k, w), 


—(r — ro)/k if O<y<k, 
V(r, k, Deg een er if a 


Note that here the frequency w > 0 is a free parameter and may be chosen 
independently of r and k. The corresponding equation of state reads 


U(r, k, S) = 4 (r — ro)? S? k (1 — k). (3.20) 


We mention that the hard sphere model describes the high energy limit for 
certain potentials, see [Tod81] for the Toda potential, and [FM02], as well as 
Sect. 3.2, for Lennard-Jones potentials. 

The third example is the small amplitude limit, where the amplitude 
ô of V is defined as the first fourier coefficient, i.e. for odd V we find 
6= h V(w) cos (27y) dy. To identify the leading order terms we expand 
the nonlinear interaction potential ® around the mean distance r up to fourth 
order. To leading order the frequency w must satisfy the harmonic disper- 
sion relation which now reads Qo(r, k) = ./®"(r)sin (k7)/7. According to 
[DHMO06], the amplitude ô and the action F can be expressed in powers of 
w — Qo(r, k) as follows 


w? — lr, k)’ 
2 olr, k)? 
F(r, k, w) = —®(r) + G(r, k)(w — Qo(r, k))? + O((w — Qo(r, k))®), (3-21) 


= G(r, k), 


where G(r, k) is given by 


p” (r)? 
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3.2 Solitary waves 


Homoclinic orbits in ODEs are typically accompanied by large wave length 
periodic orbits in the sense that there exists a parameter curve of periodic 
orbits converging to the homoclinic orbit as the period tends to infinity [VF 92]. 
For the lattices we consider, the situation is similar: wave trains exist for 
arbitrarily larger wave number and limit to solitons as the wave number tends 
to zero. This was proven for certain monotone waves and potentials under 
growth assumptions in [PP00] by a mountain pass approach. 

We thank Karsten Matthies (University of Bath) for providing notes on 
which the remaining part of this section is based. We report some of his joint 
work with Gero Friesecke, mainly concerning solitons in (1.1) in the form (3.1) 
for a large class of possibly non-convex potentials ©. 

A prototype of physically realistic interaction is given by the standard 
Lennard-Jones potentials 


2 
S(r) = a(r-™ — ro) forr>0, a>0, mEN. 


where ® is minimized when neighbouring particles are placed at some spe- 
cific equilibrium distance r = rą > 0, and tends to infinity as the neighbour 
distance tends to zero. 

Since the particle positions £a corresponding to displacements Ya are £a = 
Txa + Ya, this means that (r) must have a minimum at r = r, and that 
(r) — co as r > 0. More precisely we assume: 


(H1) (Minimum at r,) & € C(0, 00), P(r) = G(r.) = 0, P(r) > 0 
(H2) (Growth) @(r) > cor—! for some co > 0 and all r close to 0 

and (r) = œ forr <0. 
(H3) (Hardening) &”’(r) < 0 in (0, r4], (rn. +r) < Bra — r) in (0,1). 


Here we seek solitons whose profile Y(¢) solves (3.2) with r = rą. The con- 
struction in [FW94] for the existence of solitons is based on the variational 
problem 


1 
Minimize y,(Y) := al 3Y ($) de among Y € WẸ? (R) satisfying 
R 


3pY € L?(R), W.(Y) := i S(r + Y(¢ +1) — Y(¢))d¢. (3.22) 
R 

Remark 3.12. It is instructive to compare this ansatz with the one used for 
wave trains in Sect. 3.1, where the real line is replaced by the unit interval 
and W maximized for fixed norm parameter y. This lead to a relatively simple 
convex maximization problem for convex potentials. In contrast, (3.22) is a 
kind of dual problem, where W, is fixed and the norm parameter y, minimized; 
a more challenging formulation that allows for non-convex potentials. 
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The goal is to determine the I’-limit of the variational problem and the 
limiting profile in the high-energy regime. Since this regime is highly discrete 
and involves strong forces, neither classical continuum approximations nor 
weak coupling approximations are possible. 

The limiting profile for W,(Y) — oo was derived in [FM02]. Here we 
recover this as a corollary of the following I -convergence result. We let 


H* := {Y € W2?(R)|Y(0) = 0, 3pY € L?(R)}, 


and for every displacement profile Y we denote the relative displacement pro- 
file by r(¢) = Y(@ + 1) — Y(¢). As in (3.22) we consider the functional yx 


on 
Hy, = {Y € H*|W,(Y) = K} 
H% = {Y € H* |r(¢) 2 =r; VO € R 
J compact nonempty set Sy C R with rjg, = —r«}. 





Theorem 3.13. (T-convergence) Assume that the interaction potential satis- 
fies (H1), (H2). Then the problem ’Minimize y,.(Y) for Y € Hj,’ '-converges 
to the problem ’Minimize y.(Y) for Y € Hx,’ in the sense that 


1. (lim-inf-inequality) If Y — Y in H* with Y © Hy, Y translation 
normalized (i.e. r*)(0) = mingerr)(¢)), then Y € H*, and 7,(Y) < 
lim inf x> Jx(Y¥), 

2. (Existence of recovery sequence) For all Y € Hx, there exists a sequence 
YX) c Ht with YX) — Y in H* and y (YŒ) > 7.(¥). 


A consequence is the following piecewise linear asymptotic displacement pro- 
file, corresponding to piecewise constant velocity profile. 


Corollary 3.14. (Asymptotic shape of minimizers) Every translation nor- 
malized sequence YO of minimizers of yx on Hi, converges in H weakly 
to the up to translation unique minimizer Yæ of the limit problem, where 


0, <0 
Yo0(¢) = =O; O € (0, 1] 
Tx, oe 1. 


In a mechanical interpretation, this is a compression wave localized on a 
single atomic spacing. The limiting dynamics are hard-sphere dynamics like in 
a Boltzmann gas, see Fig. 3.1. In particular the work shows that dispersionless 
transport of energy is not restricted to the long-wave regime. 


We mention that Friesecke and Matthies analyse a two dimensional coun- 
terpart of (3.1) in [FM03], see Fig. 3.1. The existence of longitudinal solitary 
waves along one of the lattice directions was shown for typical potentials un- 
der some mild nondegeneracy assumptions. These traveling waves are unique, 
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Fig. 3.1. Left: Hard-sphere soliton. Right: cell of springs in 2D lattice. 


i.e. there are no other localized traveling wave in the same direction, e.g. there 
do not exist transversal traveling waves. It is surprising that purely harmonic 
springs are included here, because solitary waves do not occur in harmonic 
chains. 


4 Modulation Theory 


4.1 Macroscopic evolution of data with temperature 


In this section we use the theory of Young measures, see for instance [Tay96, 
War99, Daf00], and derive some restrictions for any thermodynamic limit of 
the chain. 

Let N = {(@, a) : 0 < T< tan, & € [0, 1]}, and let (Nj); be a sequence 
with N; — oo. Moreover, for any i let QQ (t) = (rË (t) vw) (t)), 0 < t < Niten 
and a = 1, ..., Nj, be a solution of Newton’s equation, and suppose that the 
total energy of the initial data is proportional to Nj, i.e. 


Ni 
1 ; 2 : 
> Go) + a(r®0)) = O(N;). (4.1) 
a=1 

Under some suitable assumptions on the potential (say boundedness of &” for 


simplicity) the functions Q® (t) are compact with respect to the convergence 
of Young measures in the following sense. There is at least a subsequence, still 
denoted by (N;);, and a family of probability measures (t, a) > u(t, a, dQ) 
such that for any continuous observable Yy = u(Q) = y(r, v) the following 
convergence is satisfied 


f P(QO (N, Nia) EE a) ddaa => | ()(E a) E(t, a) dtaa. 
Í d (4.2) 
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Here € is an smooth test function, and (4) (t, @) is given by 


(Ea) = | WO nE a, dQ). (4.3) 


For fixed (t, @) € Q, the probability measure u(t, @, dQ) describes the mi- 
croscopic oscillations in the vicinity of (t, a), and for any observable ¥ the 
number (W) (t, @) gives the local mean value of Y. 

Here we consider the common probability distribution of distance and 
velocity instead of their separate statistics so that any measure u(t, a, dQ) 
can be interpreted as a weight function defined on the microscopic state space 
which is spanned by distance and velocity. 

In Sect. 1 we have seen that Newton’s equations are equivalent to the 
two microscopic conservation laws (1.3), from which one can derive the mi- 
croscopic conservation of energy éa(t) = —fa(t) + fa+i(t) with ealt) = 
$0241 (t)+P(ra(t)) and falt) = —va(t) P (ra(t)). As a direct consequence, ev- 
ery Young measure limit must satisfy the following macroscopic conservation 
laws of mass, momentum and energy 


ar(0)~ Ba) = 


Pe ay ‘(r)) 
Oz (gv" + P(r) — Ix (v P'(r)) 


II 


0 
0, (4.4) 
0 


This system of PDEs gives some restrictions for any young measure limit of the 
atomic chain. However, in general we can not express the fluxes in terms of the 
densities, and hence the system (4.4) is not closed, i.e. it does not determine 
the macroscopic evolution completely. We mention that (4.4) shows that any 
Young measure limit is a measure-valued solution of the p-system in the sense 
of DiPerna, see [Hör97, Daf00]. In addition, it is a measure-valued solution of 
the energy equation (1.11). 

Within modulation theory we will start with some assumptions concerning 
the structure of the microscopic oscillations in the chain. Afterwards we will 
identify further macroscopic evolution laws extending (4.4), and constitutive 
relations that close the extended system. 


4.2 Whitham modulation equations for wave trains 


Here we describe Whitham’s modulation theory for the atomic chain with 
hyperbolic scaling. For further examples concerning modulation theories of 
discrete system we refer to [HLM94, SW00, FP99, DK00, GM04, GMO06], and 
to [GHM06a, GHMO6b] for an overview. 

A modulated traveling wave is an approximate solution of Newton’s equa- 
tion (1.1) satisfying 


> 1 
Talt) = =X(¢t ca) +i(ot EQ, = Olet, ca), (4.5) 
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where X and O are macroscopic functions. The generic traveling wave param- 
eters (r, v, k, w) now are macroscopic fields depending on (t, a), and read 


w = 070, k = ôz O, v = ð; X r=O0qX. (4.6) 
The microscopic oscillations are described by 
X(t, T, y) = X(r(t, T), v(t, a), k(t, 3), w(t, a), a(t, a), p), (4.7) 


where X(r, v, k, w, a, p) is a smooth family of 1-periodic wave trains depend- 
ing on the parameters u = (r, v, k, w, a) as well as on the phase y. We use 
an additional parameter a, which might be the entropy S or the parameter y. 
However, in any case we impose an abstract dispersion relation 


w = N(r, k, a). (4.8) 


The modulation equations are PDEs which describe the macroscopic evolu- 
tion of the modulated parameter (r, v, k, w, a), and ensure that (4.5) indeed 
provides approximate solutions. For their formal derivation we use Whitham’s 
variational approach, see [Whi74, FV99, DHMO6], and [GHM06a, GHMO06b] 
for a more general setting. 

In a first step we insert the ansatz (4.5) into the expression for the total 
action in the atomic chain, and expand all arising terms in powers of ¢. This 
gives rise to the reduced action integral 


thin 1 
total action = L(X, O, a) = J Jeeg a)) dade, (4.9) 
0 0 


with L(u) = L(u, X(u, -)) and 


C(u, X) = T (50 + wOpX)? — Br + vx) do 


where (VX) (y) = X(p + k). In a second step we apply the principle of least 
action to (4.9). The variation with respect to a gives 0,L = 0, which recovers 
the dispersion relation (4.8), and the variations with respect to X and O yield 


ðzðL+37əð,L=0 and 070,L+0c%L =0, (4.10) 


respectively. Moreover, the definitions (4.6) imply two further evolution equa- 
tions, namely Orr — Ogu = 0 and 07k — Ogw = 0. 

In the last step we reformulate all macroscopic identities by using the 
thermodynamic definitions from Sect. 3, and as a consequence we find that 
the modulation equations take the form 


Oz (r, v, k, 5) +z (-». +p, —w, +9) =0. (4.11) 
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These equations represent the macroscopic conservation laws for mass, mo- 
mentum, wave number and entropy. Moreover, they imply the conservation of 
energy via 


zE + Ox (pv + gw) =0. (4.12) 


and thus we can regard the system (4.11) as an extension of (4.4). Recall 
that the closure for (4.11) and (4.12) is provided by the equation of state 
E = $v? + U(r, k, S) and the Gibbs equation (2.5). However, for almost all 
interaction potential ® we lack explicit expressions for the equations of state, 
and therefore we cannot characterize the properties of (4.11). 

Finally, we display the modulation equations for the harmonic chain 


Oz (r, v, k, s) — ôx (v, cor, w(k), w (k) s) =0, (4.13) 


which follow from (4.11) by means of the equation of state (3.19), and the 
harmonic dispersion relation (3.18). 


4.3 The justification problem 


So far, there is no known rigorous derivation of the modulation equations 
for the nonlinear case. For this reason we formulate a conjecture, following 
similar results for partial differential equations [KSM92, Sch98, Mie02]. We 
assume that the potential ® is sufficiently smooth, and that a smooth family 
of traveling waves X(u, y) with independent parameters u = (r, v, k, w) is 
given. Moreover, we assume that the following set M is open 
the system (4.11) is strictly hyperbolic in u, 
ae fu = tek) | the a ee X(u, -) is neal stable \ 


For a given solution ù = ù (t, @) of (4.11) we define 
F(et, ea) + (A,V) (ŭlet, Eq), 16(et, ea) + Lk(et, ea)) 


16 (et, ea)) ' 


a 


M(t) = O(et, ca) + O(et, ca) V(a(et, Ea), 


where V abbreviates 0,X, and the modulated phase ð is given by (4.6). We 
believe that the following conjecture is in the heart of the matter. 


Conjecture 4.1. Let u be a sufficiently smooth solution of Whitham’s equation 
defined for t € [0, tn], and suppose that ŭ takes values in M. Then there 
exists a suitable Banach space Y., and some exponent « > 0 such that 


IH — M lly, = 0"), 1R Olr, = OC) (4.14) 


for all €, and all t with 0 < et < tfn- 
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At the moment we are far from being able to prove this conjecture in this 
general form. However, it does hold rigorously for the harmonic chain and the 
hard sphere model. 

The proof for the harmonic chain essentially relies on the linearity of 
Newton’s equations, which allows to control the residuum, see [DHMO6]. 
In addition, there is further rigorous derivation of (4.11) in the context of 
Wigner measures. For the details we refer to [Mie06], and for similar results to 
[Mac02, Mac04]. The rigorous justification for the hard sphere model is based 
on the observation that both the microscopic dynamics and the macroscopic 
equations become much simpler in the Eulerian representation of thermody- 
namics, cf. [Her04] 

On a formal level we expect a close relation between stability of wave trains 
and hyperbolicity of modulation equations; if Whitham’s equations (4.11) are 
not hyperbolic, then the corresponding initial value problem is ill-posed, which 
indicates that traveling waves are unstable due to a Benjamin-Feir instability, 
see e.g. [Whi74, BM95]. However, for arbitrary interaction potential ®, nei- 
ther stability criterions nor hyperbolicity conditions are available up to now. 
Having linearly degenerate eigenvalues, the harmonic chain and hard sphere 
model are not prototypical and do not provide further insight. Only the small 
amplitude limit gives some criteria for the hyperbolicity of (4.11). Starting 
with the equation of state (3.21) we can compute the characteristic speeds 
for (4.11), see [DHM06], and end up with the following criterion. The system 
(4.11) has four real eigenvalues, and is thus hyperbolic, if 


N(r, k) =("(r)) 2 (7 — 8 cos (27k) + cos (4rk)) $ 


&" (r) B (r) (4 cos (27k) — 3 — cos (4nk)) 
is positive, but has two imaginary eigenvalues for N(r, k) < 0. For k = 1/2 
the corresponding formula was already given in [Fla96]. 


5 Numerical justification of modulation theory 


Although there is no rigorous justification for the modulation equations (4.11), 
numerical simulations strongly indicate that they provide the right thermo- 
dynamic description for a wide class of initial value problems for the atomic 
chain. We refer to [DH06] which gives a detailed thermodynamic interpreta- 
tion of several numerical experiments. The main results can be summarized 
as follows. 


(i) If all macroscopic fields are smooth, then the arising oscillations in the 
atomic data can be described in terms of modulated traveling waves, and 
the macroscopic dynamics is governed by the modulation system (4.11). 

(ii) Modulated traveling waves describe the microscopic oscillations emerging 
when cold data form shocks. 
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(iii) If the shocks emerge from data with temperature, then usually the mi- 
croscopic oscillations exhibit a more complicated structure, and (single- 
phase) modulation theory fails in this case. 


Concerning the last item, results for the Toda chain suggest a hierarchy of 
modulation models, enumerated by the number of phases, where shocks on a 
lower level require the model of the next level, see e.g. the review [LLV93] 
and the references therein. 

Note that these numerical observation are valid only if the interaction po- 
tential ® is convex, the macroscopic scale results from the hyperbolic scaling, 
and the microscopic initial data are given by modulated traveling waves. 

In this section we give an brief survey on the numerical justification from 
[DH06], and present a typical example with periodic boundary conditions 
and smooth macroscopic fields. Moreover, in Sect. 6 we study the numerical 
solution of a Riemann problem with cold initial data, and give an improved 
discussion of its macroscopic limit. 


In order to study the macroscopic behavior of the atomic chain for large N 
we must evaluate the thermodynamic properties of the numerical data which 
are the macroscopic fields of the local mean values, and the local distribution 
functions of the atomic data. The computation of both mean values and dis- 
tribution functions relies on mesoscopic space-time windows. In what follows 
let F = IF x if be such a window where Ir and Ip are sets of time steps 
and particle indices, respectively. The window F is mesoscopic if and only if 
it is very small on the macroscopic scale, but contains a lot of particles as 
well as time steps, i.e. Hr, tip ~ N“ for some exponent « with 0<« < 1. In 
particular, any F describes the microscopic vicinity of a certain macroscopic 
point ZF = (ts, ar). 

For any atomic observable ~ we can easily compute the mean value (4) + 
of ~ with respect to each window F by a simple averaging formula. Note that 
there is a close relation to the notion of Young measures. In particular, if the 
atomic data converge for N — oo in the sense of Young measures, then (w) + 
is a good approximation for (w)(t¢, @) from (4.3). Moreover, by means of 
F we can compute the complete measure u(tr, HF, dQ), see [DH06] for the 
details. 


The micro-macro transition of modulation theory relies on the hypothesis 
that all atomic oscillations can be described by modulated wave trains. If this 
is right, then the microscopic distributions functions within any space-time 
window F must be equivalent to an exact wave train. Of course, the parame- 
ters of this wave train may depend on £. In order to justify this hypothesis for 
given F, we have to identify four wave train parameters, namely the specific 
length rz, the mean velocity uz, the wave number kp and a fourth parameter 
which might be the parameter yz, the frequency wz, the entropy Sr, or the 
temperature TF. 
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The values of rz, ve and Tr are given by mean values of microscopic 
observables. This reads rz = (r)p, ur = (v)z, and TF = (v?), — (w) 
Determining k and w¢ is not so obvious, because they have no immediate 
physical interpretation on the microscopic scale. To overcome this problem we 
introduce auxiliary observables Wg and Y, see [DH06] for their definitions, and 


set 
kr := (he, wr = (Pop: 


In the next step we start a numerical scheme similar to (3.3), which allows to 
prescribe the values rf, vf, kf and Tr, see [DH05] for details, and compute 
an exact wave train with these parameters. For any F, the scheme yields a 
profile function Vp as well as a frequency wW which does not result from 
the auxiliary observable W%, but satisfies a dispersion relation. 

Finally, we compare the microscopic distribution functions from the nu- 
merical data with their macroscopic predictions which can be expressed in 
terms of Vpr. In particular, according to modulation theory, the support of 
the microscopic distribution functions must equal the curve 


p= Qt (Y) = (rx + ÂkV Fle + ke/2), ur + wr Vx(¥)) (5.1) 


This rather strong prediction can be check for given numerical data. 


Smoothly modulated initial data 
We study the evolution of data with temperature by imposing initial data in 
form of smoothly modulated binary oscillations, i.e. we set 

r°dd(ca) if a is odd, _ fv 4(eq) if a is odd, 


Ta (0) = one if a is even, Va (0) = ee) if a is even, 


where rodd, peven ytd and ve? may be read off from Fig. 5.1. We solved 
Newton’s equation for the Toda chain with N = 4000 up to the macroscopic 
time tgn = 0.4 by means of the Verlet scheme, see [SYS97, HLW02]. 
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Fig. 5.1. Snapshots of the atomic distances and velocities at several macroscopic 
times. The vertical lines at t = 0.4 mark the space-time windows for Fig. 5.4. 
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Fig. 5.2. Selected macroscopic fields as functions of @ for t = 0.4. 


Figure 5.1 contains snapshots of the atomic data for several macroscopic 
times, where the black colored curves represent the local mean values, and 
Figure 5.2 shows the profiles for some macroscopic fields at time t = 0.4. We 
observe that the atomic data are highly oscillating on the microscopic scale so 
that any appropriate mathematical descriptions of the limit N — oo must rely 
on measures. The computation of wave number and frequency is illustrated 
in Fig. 5.3, showing the oscillating values of the auxiliary observables Wy and 
Wg as well as their macroscopic mean values. 

In Fig. 5.4 we compare the microscopic distribution functions with their 
macroscopic predictions from modulation theory for six mesoscopic space-time 
windows at t = 0.4. For each of these windows we represent the distribution 
function of microscopic data by a density plot with high (Gray) and low 
(White) probability for finding a particle. Note that the support of every 
distribution functions is contained in closed curves, and that the distribution 
functions vary on the macroscopic scale. 

The black dots in Fig. 5.4 represent the macroscopic predictions: we project 
20 points Q; = QTW (i/20) of the curve (5.1), into the density plots. Figure 
5.4 reveals that the curve (3.13) coincides with the support of the microscopic 
distribution functions, and that the distance between Qi+ı and Q; is related 
to the gray level of the microscopic distribution functions. In conclusion, we 
can describe the microscopic oscillations within any window F by a periodic 
wave train. Moreover, we can conclude that the macroscopic evolution of the 
thermodynamic fields is indeed governed by the modulation equations (4.11), 
see the discussion in [DHO6]. 
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Fig. 5.3. Wave number and frequency: oscillating auxiliary variables and macro- 
scopic mean values. 
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Fig. 5.4. Distribution functions of the atomic data in three selected space-time 
windows at ¢ = 0.4; for the @coordinates see Figure 5.1. Each picture contains a 
density plot of the atomic data (White and Gray) together with an illustration of 
the macroscopic prediction (Black). 


6 The shock problem 


Since we expect a hyperbolic system describing the macroscopic limit, it is 
natural to investigate Riemann problems and interpret the results in terms of 
hyperbolic theory. A goal of this is to indicate selection principles for Riemann 
solvers that account for the macroscopic limit of atomic chains. 

We would naively expect to find rarefaction fans, shocks and possibly 
contact discontinuities, that are selected by characteristic curves and entropy 
conditions and whose velocities are determined by characteristic velocities and 
Rankine-Hugoniot conditions. 

It turns out that this picture is invalid when microscopic oscillations oc- 
cur, leading to modulated wave trains as mentioned in Sect. 4. Instead, we 
find a situation very similar to the zero dispersion limit of the KdV equa- 
tion mentioned in Sect. 1, where dispersive shock fans replace Lax-shock, 
and where velocities are not given by characteristic velocities of the limiting 
Burger’s equation, corresponding to the p-system in our case. Faced with a 
large number of publications on this matter, we restrict references here to 
[LLV93, LP05, El05] and the bibliographies therein. 

We focus on cold initial data, i.e. constant displacements and velocities 
with a single jump at some Q,, i.e. 


(r,v)(@,0) = (r_,v_), & < @,, and (r,v)(@,0) = (rz, 04), @> Ae. 


The macroscopic limit of the harmonic potential for such Riemann problems 
is cold and described by (4.13). It is therefore described by the corresponding 
p-system, which is a linear 1D wave equation, whose dynamics can be under- 
stood directly from the d’Alembert solution form, so there are only contact 
discontinuities. 

For general nonlinear potentials, there is numerical evidence that disper- 
sive shocks appear for initial data leading to Lax-shocks of the p-system, while 
rarefaction data leads to cold macroscopic limits described by the p-system. 


Wave Trains, Solitons and Modulation Theory in FPU Chains 493 





Distances, Time=0.0 Distances, Time=0.15 Distances, Time=0.3 
= +2.00 +2.00 


+1.50 +1.50 +1.50 





+1 00 : +1.00 
` Velocities, Time=0.0 


+1.00 





e Velocities, Time=0.3 k 





























+1.19 +1.19 š 
+0.00 —— lű +0.60 +0.59 
+0.00 +0.00 
o. 1. 0. 1. 0. 1. 





Fig. 6.1. Riemann problem with N = 4000 and (r) = (r — 1)? /2 — cos(2(r — 1))/4 
with one rarefaction wave and one dispersive shock: snapshots of atomic distances 
and velocities for t = 0.0, t = 0.15, and t = 0.3. 


In Fig. 6.1 we plot a typical situation for illustration, and sketch a dispersive 
shock fan in Fig. 6.3. We are particularly interested in the transition of the 
Whitham modulation at its front. 


Remark 6.1. For convex flux, i.e. P” > 0, the p-system can be solved uniquely 
in terms of at most two rarefaction or shock waves [Smo94]. For non-convex 
flux the situation is more complicated, and the entropy conditions for the p- 
system no longer agree, because eigenvalues are no longer genuinely nonlinear 
[KS97, LeF02]. A specific choice of a convex-concave potential for (1.1), nu- 
merically yields a macroscopically cold, strong shock, connecting states with 
equal characteristic velocities, and traveling with a different Rankine-Hugoniot 
velocity. In particular, it is not a contact discontinuity or Lax shock, but a 
(fast) undercompressive shock. Details on this phenomenon will be published 
elsewhere. 


The macroscopic dynamics in space-time for Riemann data appear to be 
self-similar, hence reducible to a macroscopic velocity variable c = a/t = 
a/t. More formally, we assume that the Young measure ju(c) arising in the 
macroscopic limit for (initially cold) Riemann problems at each c is either a 
point measure or supported on a closed curve, corresponding to a wave train, 
so that from the modulation ansatz (4.5) we obtain X(c, y) and analogously, 
from Sect. 4.3, an expression M*(c, p) for the vector of modulated distances 
and velocities. We use the phase variable y to parametrize the support of ju(c). 
In case u(c) is a point measure, we obtain a strong limit where X(c, y) = 0. 

A dispersive shock spans a range of speeds from the shock back velocity, 
Cp, to the shock front velocity, cp. To ease notation, we assume 0 < cp < Cf, 
and that the constant states to the left and right of the dispersive shock are 
(r_,v_) and (r4,v+) as sketched in Fig. 6.3. 

It is instructive to view the modulation of wave trains in a dispersive shock 
as the selection of a curve in the set of wave trains X(c, p) parametrized by 
Ch < c < cf in terms of the wave train parameters (r(c), k(c), w(c)). This curve 


494 W. Dreyer, M. Herrmann, J. Rademacher 






















































































Distances Sie ed i Temperature 
+2.00 distribution functions +0.07 
+0.83 
+1.50 +0.03 
| 
+1.00 14 8 > +0.00 le 
0. ` A A 5 0. 1. 
Mean Distance ie J 4h P Entropy 
+2.00 = ie d ‘ > 41.21 
+1.72 ==} +0.61 
-0.04 2 
+1.44 +0.95 distance +2.05 +0.00 |æ 
0 


1. 0. t: 





Fig. 6.2. Example for dispersive shocks. Left: snapshots of atomic distances and 
their local mean values. Center: superposition of several distribution functions within 
the shock; positions of the space-time windows are marked by vertical lines. Right: 
snapshots of temperature and entropy. 


bridges the energy jump between the constant states (r_,v_) and (r4, v+), 
and the wave trains become singular at cp and ce. Based on numerical evi- 
dence [HFM81] and results for the Toda chain [VDO91, Kam91], we assume 
that M®(cp) = (r_,v_) has zero amplitude, and the shock front M€ (cg) corre- 
sponds to a soliton with background state (r4, v+), where k(cg) = w(cr) = 0. 
Note that this is a singular limit of (3.4) and that Theorem 3.10 implies in- 
finite kinetic energy y(cr). We plot wave trains and fields within a dispersive 
shock in Fig. 6.2. 
More precisely, the shock front is assumed to be a homoclinic orbit 
H(s) := lim M° (cr) 
Cc Ce 

in the phase scaling y = ws with asymptotic state lim, +. H(s) = (r4, v+). 
We expect the convergence to the asymptotic state is exponential in s, thus 
Lp-norms of [H(s) — (r+, v+)] are finite. In terms of the wave train profile X 
and V = 0X we can write the second component Hz of H as 





H(s) := He(s) — v} = lim w(c)V(c,w(c)s) = lim Says 
C— CE C— Cf ds 
Both the vanishing amplitude at cp and sinusoidal oscillations, and the homo- 
clinic orbit at cp are natural codimension-1 singularity along a curve of wave 
trains viewed as periodic orbits. 

Assuming a soliton at the shock front means in particular that the mod- 
ulation system does not have a strong shock, which is challenging to confirm 
numerically as discussed below. Instead, we conjecture that at the shock front 
the entropy S jumps and (r,v,k,w) are continuous with unbounded deriva- 
tive. Heuristically, the excess energy at the jump in the initial data cannot 
be dissipated by the conservative system, but is transported dispersively via 
oscillations with two new degrees of freedom, frequency and wave number. 
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Properties at and near the soliton 


We predict the scaling of temperature and related quantities assuming the 
scaling in generic or conservative homoclinic bifurcations of ODEs [VF92], 
where the unfolding parameter, here c, is exponentially small in the period, 
here 1/w. We thus expect cp — c ~ e—*/” for some «x > 0, and so 


w(c) ~ k(c) ~ 1/log(ce — c), 


because Theorem 3.9 implies the same scaling in k. Indeed, this scaling could 
be confirmed for the case of Toda potential using the explicit solutions in 
[DKV95], and also appears in the formal derivations in [E105]. 

Temperature, entropy, entropy flux. The definition T = w? h V(y)?dy of 
the temperature of a wave train yields 


1 1/w(c) 
T() = f lole) Vle, p)? dy = w() J lole) Vle, w(c)s)]? ds 


and thus (assuming smoothness) the limiting temperature of the soliton 


1/w(c) oo 
lim T(c) = lim vo f [w(c) V(c, w(e)s)]? ds = f H(s)°ds lim w(c) = 0, 


CCE CCE CCE 


because the L? norm of H is finite. Then the scalings of temperature T, entropy 
S and entropy flux g, see (3.16), are given by 


T(c) ~ (log(cp—e))™" , gle) ~ S(c) = T(o)/w ~1, 


where we used cS’ = g’. We thus predict that the temperature is continuous 
for all c and decays to zero like 1/log. Entropy and entropy flux vanish in 
cold regions, but continuously approach a finite, non-zero value and jump to 
zero beyond the shock front. 

Since the temperature also decays towards the shock back, we expect a 
unimodal curve T(c) with a unique maximum, as is the case in e.g. a planar 
ODE where the interior of a homoclinic orbit is filled with periodic orbits and 
an elliptic equilibrium. 

However, these scalings and limiting values are difficult to confirm numer- 
ically, because the 1/log decay is hard to resolve, and the shock front could 
not be simulated in isolation from the rest of the modulation region due to 
boundary effects. 

Norm parameter y. On account of Theorem 3.10, the norm parameter y 
grows at least like 1/k, so that y(c) > —C log(cf — c), for a constant C > 0. 
This agrees with the prediction from the above entropy scaling, because 


S T 1 


y(e) = 3w Ee z ~ log(er — c). 
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Mean distance and velocity. Assuming that c unfolds the homoclinicity as 
a generic (or Hamiltonian) ODE, the flow time through a fixed small region 
near (r4,v+) grows logarithmically in c and thus for the average values we 
obtain the scaling 


r(c) = r4 — r1ı/log(cf — c) + h.o.t. v(c) = v4 — v1 / log(ce — c) + h.o.t., 


with some constants r1, v2, since the limiting values are those of the corre- 
sponding Riemann data. 

Note that the first equation in (4.4) implies —cr’ = v’ in the sense of 
distributions, where ’ = d/dc. Therefore, —cfrı = v1, and so 


ce = — 01/11 (6.1) 


replaces the Rankine-Hugoniot jump condition. 

Propagation speeds. The modulation equations yield five equations for the 
propagation speed of the shock front; four in term of leading order expansions 
such as —cr’ = v’ — —c = du/dr above, and one jump condition c[S] = |g]. 
Indeed, in numerical simulations of dispersive shocks all these velocities are 
close to that obtained from the slope of the shock front in space-time. 

The conservation of wave number implies —ck’ = w’ and thus throughout 
the dispersive shock we have —c = Cg := dw/dk, which is the group velocity 


and not the phase velocity Cph := —w/k of wave trains. Note that here, —c is 
the expected propagation velocity due to the choide of sign for w in (3.3) and 
(4.7). 


In particular, the shock front should move with the limiting group velocity, 
while the soliton speed naturally is the limiting phase velocity. However, in 
the solitary limit, phase and group velocity typically coincide, because for 
L =1/k we have the identity 











Cg = Oph — L te 
IA 
furcations in ODE [VF92]; the identity follows from eg = 





Recall that the phase velocities of wave trains were estimated 7 in (3.14) and 
rigorously imply that the soliton velocity is bounded (essentially) by p-system 
characteristics velocities c_, c_ of the left and right states r_, r+}. However, 
in numerical simulations, the shock front velocity cp never exhausted these 
bounds, but was strictly between c_ and c+. 

On the other hand, the shock velocity of the p-system is given by the 
Rankine-Hugoniot condition Gn = y (®' (r) — © (r+))/(r_ — r+), and in all 
cases (for P’ monotone) we numerically found the velocity ordering sketched 
in Fig. 6.3, that is, 

Cb < C+ < Crh < Cf < C_, 
where crn — cp ~ 5%. Characteristics point into the dispersive shock fan, and 


indeed, we seem to find dispersive shocks only if c— < cn < c+, see also 
Remark 6.1. 
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Fig. 6.3. Left: Sketch of a dispersive shock for the macroscopic limit of a shock 
problem in (1.1). Dashed line is the p-system Lax shock with speed crn, dotted the 
p-system characteristic velocities c+ of left and right states (r+, v+). Right: sketch 
of the r-modulation at some time t. > 0 with 1/log scaling at cg. 














Finally, we mention that the velocity c» of the shock back, where wave 
trains have small amplitude, numerically agrees with the prediction from har- 
monic modulation equations, i.e. cp = V P” (r_) sin(zk(cp)) /akp. 


Remarks and open problems 


The occurrence of dispersive shocks has only been proven rigorously for some 
completely integrable cases, in particular the Toda chain [VDO91, Kam91]. 
Unfortunately, the literature on this issue is not easily accessible to non- 
specialists, and we found it inconclusive concerning the rigorous justification 
of a hyperbolic system of Whitham modulation equations. In fact, neither the 
observation that the shock front is a soliton, nor the scaling at the shock front, 
nor the velocity of the shock back seem to be worked out. 

Similarly, to our knowledge, the selection mechanism for the soliton has not 
been formulated in terms of initial values for the Riemann problem (though 
the shock front velocity for the Toda shock problem can be computed ex- 
plicitly [VDO91]). An observation towards a selection principle could be that 
in numerical experiments for vanishing initial velocities, the dispersive shock 
exhausts precisely the range between the initial jump in the r-component. 
We also observe that the dispersive shock in (c, r, v)-space is a graph over the 
plane (0,r,v). In other words, the modulation parameter curve (r, v, k,w)(c) 
appears to be selected in such a way that wave train orbits Q are (nested) 
level sets of an unknown function. 

We hope that a study of the explicit solutions for the case of Toda poten- 
tial, and results for zero dispersion limits mentioned in Sect. 1, provide more 
insight into the general situation, in particular the prediction of dispersive 
shocks and the shock front velocity. 
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1 Introduction 


A phenomenon of fundamental importance in materials science is the late 
stage coarsening in a phase transformation in two-phase mixtures. Phase sepa- 
ration occurs for example during heat treatment, where the mixture is brought 
into a thermodynamically unstable state, in which two different phases are 
energetically favored. If the volume fraction of one phase is small, phase sep- 
aration, triggered by thermal fluctuations, results in a polydisperse mixture 
of spherical particles of the minority phase, immersed in a background phase, 
the “matrix”. Due to the large surface energy of the fine mixture the system 
is still far from equilibrium. In the last stage of the phase transformation the 
particles compete to reduce the total surface area by transferring mass from 
smaller to larger particles. As a consequence, large particles grow at the ex- 
pense of the smaller ones which eventually disappear. This particular form of 
coarsening is also known as Ostwald ripening. 

The process is well-described by a class of free boundary problems (cf. 
Sect. 2.1), which are however too complex to allow for predictions on the 
dynamics of large particle ensembles. A major advance was made in the late 
50’s by Lifshitz and Slyozov [LS61] and independently by Wagner [Wag61], 
in their nowadays classical LSW-theory for Ostwald ripening. They derived a 
mean-field model in the regime of vanishing volume fraction of particle phase. 
Within this mean-field theory they predicted universal self-similar large time 
asymptotics, which enabled them to make quantitative predictions on the 
coarsening statistics. In particular, this analysis implies universal growth rates 
for the mean particle size, which serves as a measure for the coarsening rate, 
and a universal particle size distribution. 

More than 25 years after the theory was developed, a lively discus- 
sion started in the physics and metallurgical community [Bro89, Bro90a, 
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Bro90b, HHR92, HH*89, Hoy90] whether universal large-time behavior within 
the LSW model is indeed strictly valid or whether the asymptotics de- 
pend on the initial data. This issue has by now been resolved by care- 
ful numerical simulations and also by a rigorous mathematical analysis 
[GMS98, Mee99, NP99, NPO1]. In fact, it turns out that the LSW model 
alone does not display universal self-similar large-time behavior if no addi- 
tional effects are taken into account. 

While the latter disadvantage of the LSW theory was not generally ac- 
cepted, the limited regime of validity of the LSW model due to the mean-field 
assumption, which neglects local interactions between particles, was recog- 
nized early as a significant shortcoming. Several approaches were taken to de- 
velop theories which take into account finite volume fraction of the new phase 
(see e.g. [AV94, BW79, HV88, KET86, Mar87, MR84, TE93, TK84, TKE87, 
VG84, YE*93]). A key question in this topic is whether and how correlations 
between particles develop during the evolution and how they influence the 
coarsening rate. 

In this article we review recent progress in the mathematical analysis of 
fundamental issues in models of Ostwald ripening. We will describe the regime 
of validity of zero-order mean-field models and their failure in predictive power 
due to weak selection of self-similar asymptotic states. To overcome these 
drawbacks we consider extensions of the mean-field model in the sense of a 
perturbative theory which accounts for finite volume fraction of particles. A 
main part of this article will be devoted to a review of recent results deter- 
mining the scaling and precise form of the corrective terms. 

First we will clarify why the first order corrections have different relative 
size in finite and infinite systems respectively. Second, as a main part of this 
review, we describe the main ideas and results of a perturbative theory which 
takes correlations between particles into account. Under the assumption that 
particles are weakly correlated we derive a closed system of equations for 
the one- and two-particle distribution function. By our method we recover 
a theory proposed by Marder [Mar87] in the applied literature, but under a 
more natural closure hypothesis. However, it appears, that the assumption 
of small correlations is destroyed for the largest particles by the evolution. 
Here fluctuations are relevant and we present the results of a corresponding 
recent theory [NV06a] which takes this effect into account. We also briefly 
discuss the possible role of collisions between particles. This effect seems on 
first glance negligible, but on closer inspection it turns out, that it might have 
an important effect on the long-time behavior. 

The problem of Ostwald ripening is not only a basic phenomenon in the 
aging of materials. The structure of the model also poses interesting and chal- 
lenging mathematical problems and can be viewed as a paradigm of a many- 
particle system with long range interactions through a slowly decaying field. 
Similar problems and questions arise for example in sedimentation problems. 

We limit our discussion to the issues related to the mathematical analysis 
of basic models for Ostwald ripening. Further aspects, which are relevant for 
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applications, such as the role of mechanical stresses, but seem presently much 
less accessible by a mathematical analysis, are discussed for example in the 
review papers [FPL99, Voo85, Voo92] and the references therein. 


2 Models 


2.1 The Mullins-Sekerka evolution 


The Cahn-Hilliard model [Peg89] describes nucleation, growth and the early 
stages of coarsening. We are interested in the case where an initially uniform 
mixture is unstable which means that the free energy E favors two phases. 
Each phase is characterized by a specific value of the concentration and they 
are separated by an interfacial layer of characteristic width and of minimal 
area. The dynamics, driven by the negative gradient —Vu of the chemical 
potential u and limited by the diffusion of each component, reduces the free 
energy while the spatial average of the relative concentration is preserved. 

Coarsening in the Cahn-Hilliard model is well-described by the Mullins- 
Sekerka sharp-interface model [ABC94, Peg89]. It describes the position of the 
interfacial layer by the boundary OD of a region D C 2 C R. It is based on 
the assumption that diffusion in the bulk takes place on a much faster time 
scale than evolution of the interface. Thus, the potential relaxes at each time 
instantaneously to equilibrium. The model is given in dimensionless variables 
by 


Au =0_ in the bulk 2Q\OD, (2.1) 
w= at the interface 0D, (2.2) 
V =([Vu-n] at the interface OD. (2.3) 


Here « denotes the mean curvature of the interface, n its normal, [Vu . n|] 
the jump of the normal component of the gradient Vu - n across the interface 
and V its normal velocity. Equation (2.2) is the well known Gibbs-Thomson 
law, which accounts for surface tension and (2.3) is the Stefan condition, which 
reflects local conservation of energy. The main global features of this interfacial 
motion are that it reduces the area of 0D while it preserves the volume of D, if 
(2.1), (2.2) are supplemented with periodic or Neumann boundary conditions 
on ðN. 

Some of the upcoming analysis is motivated by the fact that the Mullins- 
Sekerka evolution has an interpretation as a gradient flow. It is the gradient 
flow of the surface energy on the manifold of phase distributions with constant 
volume where the metric tensor is given by the H7! norm in the bulk. 


2.2 Small volume fraction 


In the following we consider the regime where the volume fraction covered 
by D is very small. Once the interfacial regime emerges, D consists of many 
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disconnected “particles”, which quickly become approximately radially sym- 
metric, and whose centers essentially do not move. Then it is legitimate to 
replace (2.3) by taking the average over each particle. The particles are then 
characterized by their immovable center X; and their radii R;(t) and (2.3) 


reduces to 1 


a Vu- n] d5, 2.4 
4nR? Jap, pou ee) 
where P; := B(X;, Ri(t)), and (2.2) gives 
1 


It has been established by a rigorous analysis in [AF99, AF03] that the 
Mullins-Sekerka problem is in the regime of small volume fraction well ap- 
proximated by (2.1), (2.5) and (2.4). 

It is not difficult to establish that the system (2.1), (2.4) and (2.5) is locally 
well-posed if initially particles do not overlap. Such a local in time solution can 
be extended up to a time when a particle vanishes or when two particles touch. 
The first event is not critical; we just extend the evolution up to the time when 
a particle disappears, remove the particle and continue with the remaining 
ones. In this way we obtain a continuous in time, piecewise smooth solution. 
The second phenomenon, that particles touch, is problematic. In particular 
so, as the simplification to spherical shape is not a good approximation if 
particles are close. In principle one has to go back to the full Mullins-Sekerka 
evolution (2.1)-(2.3), or one constructs an ad-hoc regularization by merging 
the two particles into one with the same volume. Since we are interested in 
the dynamics of a large set of particles with small volume fraction, we expect 
that such an event is rare if it occurs at all and does not have an influence 
on the global behavior of the system. The latter is certainly true to leading 
order, but we will in particular also be interested in higher order terms, where 
the effect of collisions might have to be taken into account. We will address 
this issue in the end of this article. 

During the subsequent evolution, the large particles grow at the expense 
of small particles, which eventually vanish. The state of the system is now 
completely described by the radii {R;}; of the particles and it is tempting to 
try to identify the evolution law for {R;};. Lifshitz, Slyozov and Wagner [LS61, 
Wag61] have done this by formal arguments and found that this evolution 
reduces to leading order solely to an evolution of the distribution of the 
{R;};. We now investigate in more detail the formal argument of LSW and 
discuss its crucial assumptions. 


2.3 Leading order approximation 


In the following we denote by 
e ¢ the volume fraction of the particles, 
e p the density of particles. 
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The typical distance (d) between neighboring particles is given by p713, 
whereas the typical particle radius (R) is given by 
4r 
3 (RY =pġ. 
We are always interested in the regime of small volume fraction, i.e. 
e<l (2.6) 
which in particular implies that 
(R) < (d). (2.7) 


According to (2.1), the chemical potential u is a harmonic function outside 
the particles. Thanks to the separation of particles expressed in (2.7) there 
exists a function ù such that 


e wis “slowly varying” in the sense that the length scale of variations of ù 
is much larger than (d), 
e uu at distances from the particles which are much larger than (R). 


This function ù is called the mean field. 

Now recall that u is harmonic outside the particles and satisfies the Gibbs- 
Thomson law (2.5) on the boundary of the particles. Since u ~ ù away from 
the particles and since ŭ is approximately constant for |x — X;| < (d), we find 
that 





= z for R; > |x — X;| 
a [l (1 — Ri ü) pox + 7 for Ri < |x — X;| « (d) 
and thus according to (2.4) 
Ri = [Vu-n] & = (Riū— 1). (2.8) 


The mean-field ù has still to be determined. It turns out that the length scale 
over which & is varying is intrinsically determined by the screening effect 
between particles. This effect is crucial in the theory of Ostwald ripening and 
we will address is separately in the next section. 


2.4 The screening length 


The screening length € describes the effective range of particle interactions. 
It is analogous to the classical Debye-Hiickel screening length in electrostatics 
and can heuristically be derived as follows. 

Consider a point charge at Xo € R surrounded by conducting balls 
B(R;, X;) which are uniformly distributed according to a number density 
p and have volume fraction ¢ < 1. The point charge at Xo creates an electric 
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field and a corresponding potential G and thus induces a negative charge on 
OB(R;, X;). This induced charge roughly equals —47 R;G(X;), where 47 R; is 
the capacity of a single ball in R3. In a dilute system capacity is approximately 
additive which implies that the total negative charge is approximately given 
by —47(R)pG. Hence the effective electric potential satisfies 


—AG = bx, —47(R)pG in R3, 


and thus 1 ; 
eee a as 2 
cO o (2.9) 
where j 
E = —— (2.10) 
4n(R)p 


is the screening length. 

Formula (2.9) shows that the presence of the balls has the effect that the 
effective range of the electric potential is limited to €, whereas the electric 
potential in a system without balls is just TEKI and decays only slowly. 


2.5 The classical LSW model 


The classical theory by Lifshitz, Slyozov and Wagner (LSW) [LS61, Wag61] is 
based on (2.8) and the assumption that the interaction range between particles 
is infinite. Hence, the mean-field ŭ is constant in space and for each time 
determined by the constraint that the volume fraction of particles is conserved. 
This leads to the set of equations 


: 1 
Ri = Re (Rit — 1), 


sr 
HO) = R 


where the sum extends over the particles with positive volume. Notice that 
the critical radius is given by 1/t, in other words, the critical radius is just 
the mean radius in the systems. Particles larger than the critical radius grow, 
smaller ones shrink. However, the critical radius typically increases over time, 
so that more and more particles start to shrink and finally disappear. 

The system (2.11) now translates without further approximation into the 
following evolution law for the one-particle number density fı = fı (R, t): 


(2.11) 


Bi fick One (pa (Ralt) -1)ft) =0 (2.12) 


with 
_ ffilR,t)dR 
— J RA(R,t) dR’ 


u(t) (2.13) 
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It is interesting to note that the LSW-model has an interpretation as a gradi- 
ent flow, a structure which is inherited from the Mullins-Sekerka model (see 
also [FSF03] for a direct derivation from thermodynamic extremal princi- 
ples). This structure is most easily described for a finite collection of particles 
R := { Ri };, which corresponds to a discrete size distribution. The correspond- 
ing manifold of a set of particles with fixed volume is characterized by 


M := {R = {Ri}; | SOR? = const. } (2.14) 
and the tangent space by 


TRM i= {V = {Vikil D RV: =O}. 


The energy is given by the surface energy 
ER)=4X_ R?. (2.15) 


On M we introduce a Riemannian metric via the metric tensor g given by 


gR(V, W) = X RVW; for all V, W € TRM. (2.16) 


For a system to be a gradient flow, it means that the direction of the evolution 
at each time, here represented by R = {R;}, is in the direction of steepest 
descent on the energy landscape. This implies that for every time the equation 


grR(R, V) = -dE(R)V forall V € TrM 
must be satisfied. In view of (2.15) and (2.16) this is equivalent to 


1 AÀ 
R? Ri 


E 
for a Lagrange-multiplier A, which is defined by volume conservation. One 
easily finds that \ = 57, 1/50, Ri, i.e. A = ū as defined in (2.11). 

A recent proof of well-posedness of the LSW model and variants of it 
[NP05] makes significant use of the corresponding dissipation identity (see 
also [CG00, Lau01, Lau02, NP00] for further results on well-posedness). 

Of main interest is whether the LSW model exhibits universal dynamic 
scaling. In fact, the model has the scale invariance R ~ t!/3. Therefore, it is not 
surprising that it has one smooth self-similar solution. Folklore suggests that 
this self-similar solution characterizes the long-time behavior of all solutions, 
thereby capturing the statistical self-similarity of Ostwald ripening. Based on 
this, LSW have predicted the following growth law for the expected particle 
radius (R): 


508 B. Niethammer, F. Otto, J. Velazquez 


(R) ~ (<1) (2.17) 


and a specific unique shape for the rescaled size distribution. Unfortunately, 
the classical LSW-theory has two serious shortcomings which will be discussed 
in the next section. 


3 Validity and failure of classical mean-field models 


3.1 Long-time dynamics and weak selection 


In fact, (2.12) has not only one but a one-parameter family of self-similar 
solutions. All members of this family have compact support and can be char- 
acterized by their behavior at the end of the support, that is their behav- 
ior for large particles. More precisely, for any a € (—1, 00] there exists a 
self-similar solution behaving like a polynomial with power a (resp. like an 
exponential for a = oo) at the end of its support. While Wagner [Wag61] 
seems not to recognize the existence of self-similar solutions for a < oo, Lif- 
shitz and Slyozov [LS61] find all self-similar solutions, but argue that only 
the one with a = oo is stable. Their stability argument takes so called ”en- 
counters” between particles into account, which are not represented in the 
LSW model (2.12). However, it seems that the latter argument has not been 
taken into account in the controversial discussion in the applied literature 
[Bro89, Bro90a, Bro90b, HHR92, HH*89, Hoy90] and the question remained, 
whether or not, universal self-similar behavior in the LSW model (2.12). Only 
several years later it has been recognized in [GMS98, MS96] and shown by a 
rigorous analysis in [NP99, NP01] that the long-time behavior of the LSW- 
equation is not at all universal but depends very sensitively on the data. 

Let us describe the main features of the long-time behavior in more de- 
tail: the first mathematically rigorous results were obtained in [NP99] (and 
in [NPO1] for a related problem; see also [CG04] for simulations) for data 
with compact support. The main result gives a necessary condition for con- 
vergence towards any self-similar solution with a € (—1, 00). It turns out that 
the data have to be regularly varying at the end of their support with the 
same power a. This result in particular also implies that for a large class 
of data - the ones which are not regularly varying - no self-similar behavior 
occurs. The results of [NP99] were improved in [NVO6b], where it is shown 
that the condition of regular variation is also sufficient for convergence for 
data which are small perturbations of the self-similar solution. Convergence 
for general regularly varying data has been established for the case that a 
is not too large. Furthermore, a kind of stability result is provided, which is 
perhaps the most interesting from the point of view of applications. It states 
— roughly speaking — that for data which are not regularly varying, but whose 
variation is bounded, the system still coarsens with the expected rates. In 
contrast to [KO02], where weak upper bounds for coarsening rates within the 
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Cahn-Hilliard model are established (see also [CNO06] for the dependence of 
coarsening rates on volume fraction), the analysis in [NVO6b] concerns the 
simpler mean-field model but provides much stronger results, namely point- 
wise upper and lower bounds. 

All the results [NP99, NP01, NVO6b] are for self-similar solutions with 
finite a. In [NV06c] the case a = oo, that is the case of the LSW solution, is 
considered. A necessary and locally sufficient criterion on the data for conver- 
gence to this self-similar solution is established. In a certain sense the domain 
of attraction is much larger, since the corresponding condition is much less 
stringent than regular variation. We also refer to [Car06, CP98], where anal- 
ogous results for a simplified LSW model have been obtained. 

While the case œ = oo is already more involved than for finite a, the 
analysis of the long-time behavior of the LSW-model for data with noncom- 
pact support is again much harder. Partial results have been obtained in 
[Vel06a, Vel06b], which show that also in this case the long-time behavior is 
extremely sensitive to the initial data. 

Let us also mention here an ad-hoc approach to overcome the unphysical 
“weak selection problem” in the LSW model. was added in (2.12) In [Mee99] 
and [Vel98] a second-order term was added to (2.12) accounting for fluctua- 
tions from the nucleation regime. The main role of this term is to create an 
infinite tail for the radius distribution and it is argued that then the only pos- 
sible limit is the LSW-solution. However, this regularization is derived on an 
ad-hoc basis and the physical relevance seems not clear at this stage. In Sect. 5 
below, we will present results from a recent analysis [NV06a], where a second 
order term is derived from the Mullins-Sekerka evolution, which accounts for 
pair-interaction between particles. 


3.2 Narrow range of validity of the classical LSW-theory 


The second problem within the classical LSW-theory is that all self-similar 
profiles and the corresponding coarsening rates do not agree with experiments. 
It is common belief that this discrepancy is due to the finiteness of the volume 
fraction of particles. As mentioned previously, one can find a large variety of 
approaches in the physics literature to predict the first-order correction to the 
zero order theory (e.g. [AV94, BW79, HV88, KET86, Mar87, MR84, TE93, 
TK84, TKE87, VG84, YE*93]). One main goal of this paper is to summarize 
how to access the first-order correction analytically. 

Before we start let us briefly summarize the by now fairly complete results 
on the rigorous derivation of zero-order models. The first rigorous derivation of 
the classical LSW-model in the case that the total system size is much smaller 
than the screening length was obtained in [Nie99, Nie00]. In this regime it has 
been shown in [Vel00] that stochastic effects through noise in the data cannot 
regularize the weak selection problem in the LSW theory over the relevant 
time scales (times over which still a sufficiently large number of particles is 
present). In the case that the system is of the order of the screening length or 
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larger one obtains in fact an inhomogeneous extension of the LSW model. In 
this regime the evolution for {R;}; reduces only to an evolution of the joint 
distribution of particle radii and particle centers fı(R, X,t). We have again 
(2.8), but (2.13) generalizes to 


—Axt + 4r (afas dR — [ nar) = 0. (3.1) 


The kinetic equation (2.12) now has X as a parameter. Again, this equa- 
tion clearly demonstrates effective screening in an arrangement of charged 
particles, since the Green’s function of the operator —Au + pu = 0, where 
u = 4r f Rf dR, decays exponentially fast over length scales of the order of 


1/2 
£. Since in view of (2.10) we have € ~ (4) ~ (d)@—'/6 we have in the 


regime of small volume fraction that € >> (d) and consequently the interaction 
range of one particle still contains a large number of particles. 

In the case that the system is of the order of the screening length equa- 
tion (3.1) coupled with (2.12) has been rigorously derived in [NOO1b] (see 
also [NOOla] for the more difficult two-dimensional case), a construction of 
correctors and deterministic error estimates is provided in [GNO2]. 

The work [NV04a, NV04b] handles the most relevant situation where the 
system size is much larger than the screening length. This implies that when 
rescaling the system with respect to the natural length scale, the screening 
length, one obtains a homogenization problem in an unbounded domain. One 
important step in the analysis is well-posedness for the limit problem in un- 
bounded domains [NV04c] and the result [NV06d] which establishes that the 
fundamental solution of the microscopic problem decays exponentially w.r.t. 
the screening length. This allows to ” localize” the homogenization procedure 
in [NV04a, NV04b]. Special care is taken to have sufficiently weak assumptions 
on the particle distribution which include an initially random distribution of 
particles. First, it is shown that the evolution preserves such a statistically uni- 
form distribution over time, which allows a proper definition of “the” screening 
length. Second, it is established that only a very small fraction of particles 
can overlap and that this does not affect the macroscopic evolution law for 
the remaining particles. Thus, this result also rules out corrections on the zero 
order level due to a stochastic nature of the data. 





A First-order corrections 


We have seen that the LSW mean-field model is in a certain sense degenerate 
since it allows for a family of self-similar solutions, where none of them is in 
any sense distinguished. 

Experiments on Ostwald ripening on the other hand only partly confirm 
the predictions of the LSW-theory: The growth exponent in (2.17) is confirmed 
but the rate is considerably larger, see [AS*99] for a recent experiment. The 
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general belief is that the deviation is due to the finiteness of ¢. The zero 
order theory treats the spatial arrangement as if the particles were infinitely 
far away. Hence it tends to overestimate the distance over which particles 
have to diffuse and thus the constant in (2.17) underestimates the rate of 
coarsening. The magnitude of the expected correction will depend on the 
spatial correlation between growing and shrinking particles (sinks and sources 
for the diffusion field w). 


4.1 Finite systems 


If one tries to analytically access the scaling of the lowest order correction 
to LSW, one finds that the analysis for large but finite clusters is quite dif- 
ferent from that for infinite clusters and yields a different prediction of the 
scaling. This has lead to a controversy about the relevance of either analysis. 
More precisely, there seems to be a contradiction between the analysis for in- 
finite systems, which predicts the scaling ¢!/? of the correction term and the 
analysis for finite systems, which predicts the scaling ¢!/3. Based on “snap- 
shot” numerical simulations, Fradkov, Glicksman & Marsh [FGM96], see also 
[MG*98], predict a cross-over from ¢!/3 to $!/? with increasing system size. 
We will now explain the results as well as the snapshot perspective in more 
detail. 


4.1.1 Heuristic arguments 


We will give a heuristic argument for a cross-over of the order of the correction 
to the LSW theory in finite systems. This is most easily explained by the 
monopole approximation of (2.1)-(2.2), which seems to go back to Weins and 
Cahn [WC73]: In this approximation, the growth rates {B;}; of the particle 
volumes, that is —B; := 4[4¢ R3] = 47 R? ti, are obtained by solving the 
linear system of equations 





ee Spee Z1 
Ri R; Z di 


1 Bi Bj 
+ J (4.1) 
J 


and 


XB: = 0, (4.2) 


where dj; := |X; — X;| is the distance between particle centers. Observe that 
ù can be interpreted as a Lagrange multiplier for (4.2). It has been argued 
that the error coming from the monopole approximation is of higher order in 
@ than the first-order correction to the LSW theory. Indeed, the error is of 
order ¢?/3 as can be deduced e.g. from equation (2.42) of [AV94]. 

We consider a periodic setting with n particles in the box 
Qn = (Gees eae ater Then, solving the monopole approximation (4.1) 
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means to invert the matrix Id — g, where g = gij = — 7+. This is possible if 
dij 

g is small enough in an appropriate sense. If we take for instance the matrix 

norm corresponding to max; |B;| we find 


lel = sup lol ~ R(E. 


. 2 
In view of (2.10) this gives ||g|| ~ (se e) . Thus, g is invertible 


if the system size is much smaller than the screening length. In this case, the 
system (4.1) can be solved by an asymptotic expansion using the Neumann 
series, which leads to 


By = (1— Rit) +S) gig(t Ra) + S00 gizgyn(1 — Ret) + 
j j k 
and u has to be determined such that 5°, B; = 0 holds to the desired order. 


Since the entries of the matrix scale as ¢'/°, the ratio between (R) and (d), 
this yields an expansion for B; in the parameter $!/3. 


4.1.2 Numerical simulations 


The starting point in the numerical simulations of Fradkov, Glicksman and 
Marsh [FGM96, MG*98] is a fixed joint distribution of particle centers {X;}; 
in the sphere of volume a and particle radii {R;}; with the following properties 


e {X;}; has number density p in the sphere of volume 2 

e R; is distributed according to the self-similar LSW Ais ribution with mean 
volume ¢. 

e R; and Rj are independent for i Æ j. 

e {X;}; and {Ri}; are independent. 


The parameters to be varied are the average number n > 1 and the volume 
fraction ¢ < 1 of particles within 2n. 


“Snapshot” analysis means the study of the joint distribution of 
{X;, Ri, Bi}i, where the {Bi}; are determined according to (4.1). In partic- 
ular, one is interested in the deviation of {B;}; from the LSW growth rates 
{BSW}, implicitly given by the obvious truncation of (4.1) 


> saw ypa and XO BEW = 0 (4.3) 
_ Bt 


i i 


In [FGM96, MG*98], the deviation is measured by considering the rate of 


change of particle radius Li E -zr B;. It is found by numerical simula- 


tions for n € [100, 3162] [FGM96, MG*98] that the quantity 
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(ee (Bi — BES) 
(rep BE) 

shows a cross-over between ¢!/3 and ¢!/? when the sphere becomes larger 
than the screening length. In view of (2.10) this means when 


nw gV2, (4.4) 


4.1.3 Rigorous results 


To our knowledge, the numerically observed cross-over has first been unam- 
biguously reproduced by analytical tools in [HNOO5b]. This is done under 
assumptions used in the numerical simulation [FGM96], that is, the monopole 
approximation and the snapshot perspective. 

We have argued that the cross-over in the correction term occurs when the 
system size becomes of the order of the screening length, in other words, when 
(4.4) holds. We call in the following particle systems “subcritical systems” or 
“supercritical systems” if they are smaller or larger than the screening length, 
respectively. 

Heuristic arguments and the numerical simulations in [FGM96, MG*98] 
suggest that the correction in a supercritical system should scale as: 


(EHF (Bi — BESW)) 


~ 1/2 —1/2 

pm BLW @ for n > ọ . (4.5) 
In view of (2.10), the quantity ¢'/? is just the ratio between the relevant 
length scales (R) and €. Furthermore, we gave some heuristic arguments that 
the correction term for subcritical systems should scale as ¢!/3. Therefore, 
both scalings coincide at the cross-over n ~ ¢~!/? when the correction term 
in a subcritical system scales as 


(2D, pe (Bi — BES) 
Pye Bem) 

Instead of considering the expected relative deviation in the rate of change 

of the mean radius we will investigate the relative deviation in the rate of 


change of energy. This idea is motivated by the gradient flow structure of the 
Mullins-Sekerka evolution. More precisely we consider 


w nig for n &« gV?, (4.6) 


LW _ È 
ESWS (4.7) 
(Eee) 


where E is the interfacial energy of the particles, i.e. 


1 2 
an ; 4 
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and its rate of change is 





. 1 Bi 
Sey 
n UR 
while eae 
ELSW Saa i 
n R 
with 
Bey =e Rae’. ut sSW = Dil . 
2; Ri 


Since the energy is decreasing, É is always negative. Likewise E'S is always 

negative, but we expect the difference in (4.7) to be positive for most realiza- 

tions, since the LSW theory should underestimate the coarsening rate. The 

reason for measuring the correction in terms of (4.7) is that because of the 

gradient flow structure of the evolution, the quantity (4.7) can be expressed 

variationally (see (4.9)). 

The main result in [HNOO5b] is that with high probability 

ELSW È p 1/3 gi/3 for n< p12 T 
(BESW)| es gil? for n> poi? : (4.8) 


Observe that this result is somewhat stronger than (4.5) and (4.6) in the sense 
that we make a qualitative statement about the entire distribution, not just 
its expected value. 

The result is derived under some regularity assumptions on the distribution 
of the particle centers, which are satisfied with high probability for indepen- 
dently distributed centers (see Sect. 2.4 of [HNO05b] for details). Furthermore 
it is assumed that the radii are independently and identically distributed with 
respect to a probability density with compact support. In the following (-) de- 
notes the expected value with respect to the joint probability measure P of 
the variables {R;};. 

Henceforth we say that a stochastic variable F satisfies F < C with high 
probability if for any £ > 0 there exists C(e) such that P({F < C}°) < e. 


Theorem 4.1. (/HNO05b/, Thm. 2.1) (The sub-critical regime) 
Ifn <7? we have with high probability that 


: SLSW 
E-E SS op Age, 
feces 
Furthermore p oe 
(E= Be) geda ayi 


eam “o 
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Theorem 4.2. (/HNO05b/, Thm. 2.2) (The super-critical regime) 
Ifn <7? and < ġo we have with high probability that 
G mLSW 
Ogle E 
(ErS)| C 
The proof of these estimates relies on the fact that the underlying evo- 
lution has the structure of a gradient flow and thus (4.7) has a variational 
formulation. To see this, note that a solution of (4.1) can be characterized as 
a solution of 





For the solution B; we have 
1 1 Bi 1 Bi 1. 
— — — — = = -E 
ne ite 2 3 a n 2 Ri n£ 2Ri 2 


Therefore we can write the deviation in the rate of change of energy from the 
LSW result in the form: 


a 1 l= 
E- ELW = min GX: — pB? 
{Bi} Z; Bi=0 > Ri 


Dn n o 


i jAt 

















We recall that BPSW = 1 — a, where R = +>, Ri, and use >, B; = 0 to 
find 


Faf) “Sak Rag) 
Zm l-E) 
-Drt D Dc 


Thus we can express the deviation of the rate of change of energy in the 
compact form: 


È- ÈISW = Toe B Yg (B-B) + IDZA 1 


{Bi}; Bi ; t i jżi 





(4.9) 
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The variational formulation has the advantage that one can obtain a useful 
upper bound by finding a suitable trial field Bi. The construction of a proper 
trial field in the super-critical case is guided by the intuition that due to the 
screening effect the system separates into independent subsystems of the size 
of the screening length. Indeed, the LSW construction in subsystems of the 
size of order of the screening length provides a suitable upper bound. The 
mathematically most interesting part is the lower bound, which is established 
in Prop. 3.3 of [HNO05b] by the use of Fourier analysis and a suitable splitting 
of the interaction field in near- and far-field respectively. 

Remark: Notice that in the sub-critical regimes we only succeed to derive a 
lower bound, whereas we obtain an upper bound only for the expected value. 
It is not surprising, that subcritical systems have less good self-averaging 
properties than supercritical systems and, in fact, a recent rigorous result 
[CH*06] shows, that for any M > 0 there is a finite probability pm > 0 such 
that (E — E5W)/|(EESW)| > M. 


4.2 Infinite systems 


We are now interested in the precise form of the expected growth rate of 
a particle up to order ¢'/? in large (or infinite) systems and under natural 
assumptions on the statistics of the particles. 

We start by reviewing some results from the physics literature. Marqusee 
and Ross [MR84] derive the evolution of the one-point statistics under the as- 
sumption of independently and identically distributed particles. They do this 
by manipulating the non-convergent series in the monopole approximation, 
which they interpret as a multiple scattering series. 

However, it is obvious that the assumption that {(R;, X;)}; are statistically 
independent is not preserved by the evolution: A medium sized particle in the 
neighborhood of a large particle will shrink faster than in an average environ- 
ment. Hence a large particle eventually influences the statistics of {(R;, Xi)}; 
within the screening length. This in turn will influence the evolution of that 
large particle. 

Marder [Mar87] takes this effect into account and derives the evolution of 
the two-point statistics up to an error o(¢'/?). Starting from the monopole 
approximation he generates a hierarchy of equations for the expectation value 
of Bı conditioned on the position and radius (R1, X1),--- , (Rk, Xk) of a finite 
number of particles. He truncates the hierarchy on the level of two-particle 
statistics by a closure hypothesis. 

In the second part of the paper, Marder performs an analysis of the evolu- 
tion for the two-point statistics. He assumes that particles are initially inde- 
pendently distributed and then linearizes around the Marqusee-Ross theory. 
The resulting equations are solved numerically. As an effect of correlations, 
Marder’s theory predicts a significantly stronger broadening of the self-similar 
particle size distribution than the Marqusee-Ross theory. 


Correlations, Fluctuations and Collisions in Ostwald Ripening 517 


Yet a different calculus has been developed in Tokuhama, Enomoto and 
Kawasaki [KET86, TE93, TK84, TKE87]. They also start from the monopole 
approximation (4.1) but allow for arbitrary correlations. By splitting the ma- 
trix in the monopole approximation into a deterministic and a fluctuating 
part they obtain a first-order correction in ¢!/? which resemble the theory 
of Marqusee-Ross and in addition contains a term coming from correlations 
which is however not explicit. 


4.3 A perturbative theory to capture correlations 


A new method to identify the conditional expectations of particle growth 
rates has recently been proposed in [HNO05a]. We will now summarize the 
method and results of this work, which recovers Marder’s theory but under a 
more natural closure assumption. As a byproduct, one also obtains a simple 
derivation of Marqusee-Ross’s evolution for the one-particle statistics. 

The main step is to rederive the expected value of the growth rate of par- 
ticle 1 conditioned on particles 1 and 2, denoted by (B,|(Ri, X1), (R2, X2)). 
Motivated by a cluster expansion, we assume that the joint probability distri- 
bution of {(R;, X;)}i>1 has a special form which only depends on the one- and 
two-particle statistics. To express (B,|(Ri,X1), (R2, X2)) in terms of these 
one- and two-particle statistics we employ a method which allows us to sepa- 
rate screening and correlation effects. The idea is to relate the system with all 
particles {(R;,.X;)}is1 to the system {(R;, Xi) }i>441 where k particles have 
been removed. This amounts to one step in the Schwarz alternating method, a 
deterministic argument which captures the screening effects. If {(Ri, X;)}is1 
are independent, {(R:,X;)}io1 and {(Ri, Xi) }i>k are statistically equivalent 
in an infinite system. Hence expectations conditioned on the removed particles 
{(Ri, Xi) }i<z can be replaced by unconditioned expectations. This allows to 
derive closed equations for conditional expectations. 


4.3.1 Statistical setup 


As before we consider a periodic setting with n particles in the box Rn, but 
think now of the limit n 7 oo. In particular we always assume n?¢ > 1, ie. 
the box size is much larger than the screening length. 

We make the following assumptions on the statistics of our system. The 
distribution of {(R;, Xi)}i>1 is defined by a probability distribution 


Pn(Ri, X1,..., Rn, Xn) dRı Pb X,...dRy Ë Xn =: pn(1,...,n) d(1)...d(n). 


It is natural to assume that the distribution is invariant under particle ex- 
change, that is 
pr(o(1),...,0()) = pr(l,..., 7) (4.10) 


for all permutations g, and invariant under translation, that is 
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Pr( Ri, X1 = T,.. ss Rn, Xn = L) = pn(Ri,X1, sa »,Rn, Xn) (4.11) 


for all x € R. The probability distribution of the k-particle statistics 
{(Ri, Xi) }i<i<e is then given by 


Pellik) = f palon) dk +1)...d(n). 


Conditional expectations of a random variable v = v(1,...,n) are given by 


Won bk) = foao E alet 1)... a(n). 


In the limit n — oo it is more convenient to work with number densities. The 
one-particle density is given by 

2 4.10 
N(R, X) = (XD 4(R— R,)6(X — X:)) WS” np (R, X). (4.12) 


i=l 


(4.11) 
A(R) | 
The two-particle density is given by 


(4.11) 


fo(R, R, X — Š) fo(R, X, R, Š) 


= y 3 6(R — R,)6(X — X;)6(R — R,)5(X — X;)) 

i=1 jHi 
CI AN RRO XO = R2)ô(X — X2)) 
= n(n — 1)po(R, X, R, X) 


and so on. The volume fraction ¢ is given by the one-particle density via 


le JEER dR, 


the number density by 


= f spar 
and the capacity density, which defines the screening length, via 
1 
aa [Ar RAR) AR. (4.13) 


We are ultimately interested in the expected values of the growth rates (B1 |1), 
(Bı |1,2), (Bı |1,2,3) etc., which appear in the evolution equations for the 
one-, two- and three-particle statistics. Indeed, we have for fı = fı(Rı,t) and 
f = fo(Ri, Xi, Ra, Xa, t) that 


Of o- 
dh o 5 (4.14) 


oh as ena ic 
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4.3.2 Statistical assumptions 


As pointed out before, we will derive the growth rates (B,|1) etc. under 
different assumptions on the distribution of particles. 

Marqusee-Ross assumed that at any time the system described by 
{(Xi, Ri) }is1 is statistically homogeneous and uncorrelated. More precisely, 
this means in terms of the probability distribution that 


Pu(l,-.-5n) Sd apil: (4.15) 


Our main goal however is to allow also for correlations between particles. 
Pair-, triple- and higher correlations in the particle distribution are given by 


q2 (1,2) = p2(1,2) — pı (1)pı (2), 
q3(1,2,3) = p3(1, 2,3) — (pı (1)pı (2)p1 (3) 
+ q2 (1, 2)p1 (3) + q2 (2, 3)p1(1) + q2(1, 3)p1(2)), 


etc.. 
In the following we only retain pair correlations, that is we postulate 


dk =0 for k > 3. 


Moreover, we neglect products of pə in the cluster expansion, such that we 
arrive at 


pn(1,..-,2) = Mipi (i OAs yt (i, DH kiipi (k). (4.16) 
E j>a 


In the large system limit it is convenient to work with number density based 
quantities: 


g2(1,2) := n(n — 1)q2(1,2) = fo(1,2) — =*fi(1) fo(2) 
g3(1,2,3) := f3(1, 2,3) — (etn?) AAA) 
— C2 (go(1, 2) f1 (3) + g2(2,3) f1(1) + g2(1, 3) fi(2)). 


We assume that pair correlations are small, that is 


gj) j) 1/2 
=0 4.17 
ROAD OO D An 
and vanish for large distances, that is 
glij) _ 1/2 y. n us 
ROAD > olo") for € & |X; — X;j| < (5) ; (4.18) 


Due to the good ergodicity properties enforced by (4.18) the spatial average 
of u equals the ensemble average, that is 


3r = u(x) dx) =:% inthe limit n > oo. ; 
$ ua) da = (f (x) d? x) the limit (4.19) 


n n 
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4.3.3 On the expected value of the Green’s function 


As we have seen in Sect. 2.4, one way to express the screening effect is to 
establish that the effective operator of the Laplace operator in domains with 
holes is the Helmholtz operator (see e.g. [CM97] and the references therein). 

We will present here the main ideas of another derivation of this fact, which 
will also illustrate our method by a simplified problem. The link is made via 
the expected value of the Green’s function which agree up to order O(¢'/?). 


i in R3\ Ui>k+1 Pi, 
l = (4.20) 
= in Ui>k+1 Pi. 


Lemma 4.3. (/HNO05aj, Lemma 3.3) 
Under the assumptions (4.16), (4.17) and (4.18) we obtain in the infinite 

volume limit 

1 _ |e-*1| 


ae 
|z — Xıl 


(1) Be ee 1 
(G4? (a) |[1) = O(6'?)min {= eT} 


for all x € R®\P,. 


We indicate the main idea of the proof. Roughly speaking, our claim is that 
(GP (x) |1) is an approximate solution of 


1 
z 
We give here a sketch of the argument; the control of the error terms can be 
found in the proof of Lemma 4.4 in [ANO05a]. 


To this purpose we introduce the charges of Gy) on {OP;}i>2 by 


~A(G{ (x) 1) + (GM (2) | 1) = 4rô(z — X1). (4.21) 


Hi 
a 1 ag” 


qa 
By = 
1, An OP; ôn 





(4.22) 


so that up to dipolar terms 


-AGP (x) = 4n6(a — X1) + X BE) 4rô(2 — Xi). (4.23) 

i>2 
As a first approximation the Green’s function for the system {(R;, Xi) }:>2 can 
be approximated by the Green’s function for the reduced system {(R;, Xi) }i>3 


as follows: i T us = 
Ga) ~ af) @) = Ef Iga) Raa); 


which in view of (4.22) leads to 
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Bi!) ~ -RGP (X2). 

Inserting this into (4.23) (with particle 2 replaced by 7) already yields a form 
similar to (4.21): 

-AGP (x) + X 4r Ri 5(« — Xi) GY? (2) 

i>2 
= -AGP (x) + Y 4r R; 5(x — X;) GFP (Xi) (4.24) 
i>2 

x And(a — X4). 

We now take conditional expectations: 


-A(G (a) |1) + (Y 4rRi 6(@ — Xi) (GP (w) | 1,2) 11) 





E AGP (w)|1) + (So 4m Ri êle — Xi) (GE (@) | 1,4) [1) 
= -A(G (x) |1) + ( 3 AnR; 6(a — X) GFP (x) |1) 
re And (2 — Xi). = (4.25) 


Since our system is nearly decorrelated, we expect 
2 1,2 
(Gy? (w) [1,2) = (GPP (a) 1). (4.26) 


This allows us to appeal to the following argument: In the infinite volume 
limit the removal of one particle is immaterial: 


(GEP (a) | 1) = (GPP (a) |1). (4.27) 
Inserting (4.26) and (4.27) into (4.25) yields 
—A(GSY (x) |1) + (X 4m; 5(« — Xi) | 1)(GYP (2) |1) ~ 4rô(z — X1). (4.28) 
i>2 
Since our system is nearly decorrelated, we expect 
(X ATR; 5(a — Xi) |1) ~ (XC 40 Rj 6(@ — X,)). (4.29) 
i>2 i>2 
In the infinite volume limit, we have 


(X 40 R; 6(x — Xi)) x (X ATR: 6(x — Xi)) 


i>2 i>1 


(4:12) artista = X1)fi( Ri) dR, aX, 


= pork fi (Ri) dR, 


(4.13) 1 


a (4.30) 
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Inserting (4.29) and (4.30) into (4.28) yields (4.21). 


4.3.4 Separation of screening and correlations 


Using the same strategy as in Lemma 4.3 one can also relate the growth rate 
of particle 1 conditioned on a finite number of particles to the system where 
this finite number of particles has been removed. 

In analogy to the last section we denote by ul") the solution of the 
elliptic boundary value problem (2.2), (2.3) in the system where particles 
1,...,k have been removed. Our crucial result in the derivation of first order 
corrections to LSW is the following hierarchy for the expected growth rates. 


Lemma 4.4. (/HNO05a/, Lemma 3.5) Under the assumptions (4.16), (4.17) 
and (4.18) we obtain 


m| 2 


+) (1 Riu (X1)|1)) +o), (4.31) 


Ge -( ee a) 
ea Sco ae (4.32) 
1,2 
(Aado na) sarn 
(Bı |1,2,3) 1+ -pre — Fae E 
(Bo |1,2,3) | =| -2P 14% -Re P 
er set eee Ite (4.33) 


1— R, (u23) (X1)|1,2,3 
1 — Re (ul?8) (Xp) |1,2,3) | + 0(6"/?). 
1 — Rg (u-?.3)(X3) | 1, 2,3 
Like in LSW, that is Bı = 1 — Rıŭ, the formulas in Lemma 4.4 relate the 


conditional particle growth rates to mean-fields (u‘)(X,)|1), etc. The new 
elements are the factors 


d 
Re 1+4 -Pe 
EE a Nae. 5 
£ -7e g 1 + = 
12 E 


which capture screening. As opposed to the LSW-theory, which overestimates 
the distance between particles, these screening factors reflect the fact that the 
interaction range is finite and contributes as an amplification factor in the 
growth rates. 

The effect of correlations in the particle distribution are contained in the 
expressions for the mean-field (u® (X1) | 1), (uw?) (X1) | 1, 2) etc.. 
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4.3.5 Independent particles 


The results of the previous section now allow an easy derivation of the theory 
for independently distributed particles which was first derived by Marqusee- 
Ross. Our approach avoids the non-converging series in the monopole ap- 
proximation, instead we directly calculate finite particle statistics. In fact, 
our hierarchy of formulas, Lemma 4.4, allows us to make efficient use of the 
assumption of statistical independence (4.15). 


Proposition 4.5. (/HNO05a], Proposition 3.7) Under the assumption (4.10), 
(4.11) and (4.15) we find in the infinite volume limit 


(Bı) =0, (4.34) 


(B,|1) = (1 + 2) (1 — Riu) + 0(¢'/?). (4.35) 


It is straightforward to derive (4.35) from formula (4. o in Lemma 4.4. In- 
deed, since ul) does not depend on particle 1, we have (u® (x) |1) = (u (x)). 
Now, since particles are statistically independent, we find (u')(a)) = (u(x). 
However, the infinite system {(R;,Xi)}i>1 is statistically equivalent to 
{(Ri, Xi) }i>2. Hence, due to translation invariance we find that (u(x)) is 
constant and recalling (4.19) we finally have (u“(a)|1) = @. Since particles 
are identically distributed we obtain (4.34) directly from (4.2). The mean field 
ù is determined by (4.34) & (4.35) and given by 


14 8 1 
ax g 1/2 
ü = y+ wD + TR? iF (4.36) 


The system (4.14), (4.35) and (4.36) is precisely the Marqusee-Ross theory. 


4.3.6 Weakly correlated particles 


In this section, we rederive the Marder theory under assumptions (4.16), (4.17) 
and (4.18) on the statistics of the particles. 


Proposition 4.6. (/HNO05a], Proposition 3.9) 
Under the assumptions (4.10), (4.11), (4.16), (4.17) and (4.18) we find in 
the infinite volume limit 





(Bi) = 0, 
Ry z 1/2 
(Bı |1) = (1+ ral (1 — Ri(a@ + dur)) + o(6/?), (4.37) 
Ri 
(Bı |1,2) = (1+ ral (1 — Ri(a + du; + ôu2)) 
Te roe (Ree), (4.38) 
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where for i = 1,2 


ly~X1| 
e € _) go( Ri, Xi, R, y) 3 1/2 
bus = | (= ra) PEED aray = olg : 4.39 
re ail ) fi (Ri) en) ed 
The mean field @ is as in the Marqusee-Ross theory implicitly determined by 


(Bı) =0. 

The last term in (4.38) quantifies how a large particle 2 will negatively 
affect the growth rate —B, of particle 1: Particle 1 will grow below average. 
Hence the large particle 2 over the course of time affects the particle cloud in 
its neighborhood, as described by (4.14). The quantity go(Ro, X2, R, y) keeps 
book of this impact. which leads to the deviation dug in the mesoscopic mean 
field from its average value ti as described by (4.39). The equation (4.37) (with 
particle 1 replaced by particle 2) shows how this in turn influences the growth 
rate of particle 2. 

Up to an implicit term of order O(¢), (4.37), (4.38) is indeed identical 
with Marder’s result [Mar87, (2.31)]. Our derivation differs however from 
Marder’s in the initial assumption. Marder postulates, cf. [Mar87, (2.25)], the 
following relation between conditional expected charge distributions 


(So B,o(x — X;)|1,2) = on Biô(x — X:) |1) + DD Biô(x — X;) |2). 


i>1 i>1 i>1 


This assumption seems unsatisfactory, since it is an assumption on the so- 
lution {B;};>1. We replace this assumption on the solution by the assump- 
tion on the cluster expansion (4.16), which is an assumption on the data 
{(Ri, Xi) }e>1- 


4.3.7 Is Marder’s model self-consistent? 


The model derived in Sect. 4.3 has still a snapshot perspective in the sense 
that we derive the results under the assumptions (4.16) and (4.17) which are a 
priori not preserved under the evolution. One can show (cf. [HNOO05a], Ch. 3.6) 
that the assumptions are self-consistent for times of order (R)° (the natural 
time scale) for the ” bulk” of particles. However, it is argued in [NV06a] that 
the assumptions are not self-consistent in a boundary layer near the largest 
particles in the system. 

The basic reasoning goes as follows. Suppose one solves the model (4.14), 
(4.37), (4.38) for uncorrelated initial data, where fı(Rı,0) has compact sup- 
port. Consequently, the support of fo = fo(R1, Re, X1— X2, 0) is also compact 
in Rı and Rə. However, the evolution of Rı and R2, determined by (4.38), 
depends on space due to the term e~“2/§/d,2 in (4.38). Therefore, particles 
Rı and Rə which are at a distance smaller than € evolve differently than par- 
ticles Ry and Rə which are at a distance much larger than £. As a consequence 
also the support of fə in Rı and Rə varies in space. It is not too difficult to 


Correlations, Fluctuations and Collisions in Ostwald Ripening 525 


see (cf. [NVO6a]), that regions in the variables (R1, R2, X) develop where fo 
identically vanished but where fı(Rı) and fi(R2) not. In these regions gz is 
of the order fı(Rı)fı(R2), such that (4.17) is not satisfied. 


5 Fluctuations of largest particles 


For the reasons described in the previous section one cannot assume that 
correlations are small around the largest particles. Consequently, a different 
kind of correction has to be derived. The onset of large correlations for the 
largest particles is a kind of hydrodynamic regime which has to be described by 
a suitable boundary layer. Such a model has been derived in [NV06a]. A main 
ingredient is a closure relation which expresses f by evaluating fı at a certain 
shift in the variable R,. The second main task is then to explicitly compute 
the shift term to leading order. Similarly to the derivation of the results of 
Sect. 4.3, a main idea in the analysis is to express relevant quantities in a 
system of particles through the ones in a system where a particle has been 
removed. The final step consists in a Taylor expansion of fı that leads to a 
second order equation for the evolution of fı. 
The resulting model has the following form: 


Of; (Rı,t) ð 1 1 

a; -m (car mi f(t) (5.1) 
0 t Je W (s,t,@ ) raat ds Of: (Ri, t) 

ORs i o Rr (s,t, Ry) Ghia) ORs 








where the function W satisfies the integral equation 


Wista)+ pre | Kent ([¢-vow ounav) dr (5.2) 


=4nG (x,t) | (1- wee (1- aa) nnak 


and the kernel K is given by 


OR1(s,t,R) 


ES OR 
KD fa comma) Roa ha pom (RAR (6) 


Here G is as in (2.9) (with Xo = 0) and Rz = Rz(s,t, R) satisfies the LSW 
equation, that is 3s Rr = pr(Rxi/(R) — 1)), with Ry (t,t, R) = R. 
'L 
If we investigate the order of terms, one finds that (5.2) implies that W 
is of order z This in turn gives that the ” diffusion coefficient” , the term in 


front of 2 am is also of order 4. Compared with the terms on the left-hand 


side, we find that the relative size of the diffusive term if indeed of order 
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“ = O(¢'/?). However, close to the end of the support, where fı is small, 


e.g. like a power law or an exponential, the term 2 f becomes large to fı and 


the second order term is of the same order as fı in a boundary layer of size 
¢'/4 at the end of the support. This effect is exactly the same as in other well- 
known situations where boundary layer develop, such as in the Navier-Stokes 
equation with no-slip boundary conditions. 

In [NV06a] also self-similar solutions to (5.1)-(5.3) are investigated. By 
asymptotic expansions a self-similar solution is constructed, which is a per- 
turbation of the LSW self-similar solution. The perturbation has the shape of 
a Gaussian tail and induces correction to the mean particle size of order ¢!/4. 
We find 





6 Collisions versus Correlations and Fluctuations 


We already pointed out in Sect. 2.2 that particles may collide during their 
evolution and that in a discussion of corrections to the LSW model such 
events have to be taken into account. At first glance, this effect seems to be of 
higher order than correlations, since the number of particles per unit volume 
which are involved in collisions is of order pġ, hence the correction of the LSW 
model due to collisions should have relative size of order @. 

However, the situation is more subtle, because collision of particles of 
medium size produces a large particle which might dominate the long-time 
behavior. In fact, ideas along these lines have already been formulated by 
Lifshitz and Slyozov in their original paper [LS61]. They suggest that a co- 
agulation term with a kernel which is additive in the volume variable should 
be added on the right hand side of the LSW model. It is predicted that the 
corresponding correction to the self-similar LSW solution is of order Inż. 
The main reason for this large correction is due to the kinetic character of 
collisions. This implies that the fraction of particles which are transported to 
the super-critical regime is of order ¢, whereas the diffusive correction due 
to fluctuations in (5.1) transports only the largest particles to a supercritical 
regime. This effect is exponentially small in @. 

In a forthcoming article [NV06e] we will investigate the derivation and 
analysis of the ” coagulation model” by a mathematical analysis and compare 
the size of the effects with the ones induced by (4.38) and (5.1) respectively. 

This discussion highlights that there are many analogies in the analysis 
of Ostwald ripening to the mathematical analysis of gas dynamics. In gas 
dynamics there exist two relevant limits to obtain kinetic equations for the 
distribution of particle positions and velocities, namely the Boltzmann and 
the Bogoliubov [AP81] limit. The assumptions in the Boltzmann limit are 
that the range of interactions between particles is small compared to their 
average distance. In this case it is possible to reduce the dynamics to an 
equation for the one-particle distribution function, the well-known Boltzmann 
equation. In the second limit, the range of interactions between particles is 
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large compared with their average distance. The dynamics can be simplified 
under the assumption of that the ratio between average potential energy and 
kinetic energy is small. The resulting model are equations for the one- and two- 
particle distribution function. In Ostwald ripening, collisions between particles 
play a role which is analogous to collisions of gas particles in the Boltzmann 
equation, whereas the results in [Mar87, HNO05a] might be considered as a 
generalization of the Bogoliubov method. 
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Summary. We consider an ensemble of classical particles modelled by means of 
a continuity equation for a distribution function which is coupled back to the self- 
induced fields. For such infinite dimensional systems simpler effective equations are 
derived in the limit c — oo. A main emphasis is on higher order approximations, in 
particular on the first post-Newtonian order where radiation starts to play a role. 


1 Introduction 


The most precise existing theory of gravitation, the theory of general relativ- 
ity, predicts that certain astrophysical systems, such as colliding black holes 
or neutron stars, will give rise to gravitational radiation. There is a major 
international effort under way to detect these gravitational waves [Bra04]. In 
order to relate the general theory to predictions of what the detectors will 
see it is necessary to use approximation methods since the exact theory is 
too complicated. The mathematical status of these approximations remains 
unclear and only very partial results exist. Therefore it is useful to start with 
model problems. One option is the relativistic Vlasov-Maxwell system which 
plays an important role in plasma physics. Although the field part of this 
system is electromagnetic and spacetime is flat, such a model is already fairly 
difficult. It is often used to gain a better understanding of the mathemati- 
cal structures involved in more realistic gravity models. A further option for 
a simplified model is the scalar theory of gravitation, as described by the 
Vlasov-Nordstrém theory [Cal03]. It has already been considered as a model 
problem for numerical relativity in [ST93]. 

Among the approximation methods used to study gravitational radiation 
those which are most accessible mathematically are the post-Newtonian ap- 
proximations. Some information on these has been obtained in [Ren94, Ren92] 
but further rigorous progress seems difficult at this point. For that reason 
it seems to be useful to investigate (from the viewpoint of approximation 
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methods) the two systems presented above, i.e., Vlasov matter coupled to 
the Maxwell fields and Vlasov matter coupled to a scalar gravitational field 
governed by the Nordström equation. We shall explain the post-Newtonian 
expansion of the Vlasov-Maxwell system in some detail in Sect. 2 and sketch 
our results concerning the Vlasov-Nordstrém system in Sect. 3. 


Remark: Our contribution is one part of the research project jointly with 
G. Panati, H. Spohn, and S. Teufel within the Schwerpunkt. The second part 
will be covered in the separate contribution [PSTO6]. 


2 The Vlasov-Maxwell system 


The Vlasov-Maxwell system from kinetic theory models the evolution of a 
plasma or gas composed of many collisionless particles which move under the 
influence of their self-generated electromagnetic field. For the sake of simplic- 
ity we assume that there are only two different species of particles with mass 
normalized to unity and charge normalized to plus unity and minus unity, 
respectively. The particle distributions on phase space are modelled through 
the nonnegative distribution functions f* and fT, f* = f*(t,x,p), depend- 
ing on time t € R, position x € R, and momentum p € R?. It is assumed 
that collisions between single particles are sufficiently rare such that they can 
be neglected. Therefore all forces between the particles are mediated by the 
electromagnetic fields. The dynamics are governed by 


Oft+p-Veft+(E+ cp x B)- Vpf* =0, 





























cV XE = —-O,B, cV x B= QE +4rj 
(RVM.) 
V. E = 4rp, V:-B=0, 
p= [ft =f) ae, j= four Pye 
Here 
p= (1+c 7p?) ip E RS (2.1) 


is the relativistic velocity associated to p. The Lorentz force E + ctp x B 
realizes the coupling of the Maxwell fields E = E(t, x) € R? and B = B(t, x) € 
R? to the Vlasov equation, and conversely the density functions f* enter the 
field equations via the scalar charge density p = p(t, x) and the current density 
j = j(t,x) € R°, which act as source terms for the Maxwell equations. The 
parameter c denotes the speed of light for given units of time and space of 
the physical system. In order to state the Cauchy problem for (RVM,) initial 
data for the densities and for the fields have to be prescribed, 


f*(0,2,p) = fE (x, p), E(0,x) = E°(a), B(0,£)= Bea). (2.2) 











Henceforth we treat the speed of light c as a parameter and study the behavior 
of the system as c — oo. It will be explained below that after a suitable 
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rescaling the fields are slowly varying in their space and time variables. Thus 
the limit c — oo corresponds to an adiabatic limit and to slowly moving 
particles. Our general goal is to establish conditions under which the solutions 
of (RVM.) converge to a solution of an effective system. For the Vlasov- 
Maxwell system the first result in this direction was obtained in [Sch86], where 
it has been shown that as c — oo the solutions of (RVM_) approach a solution 
of the Vlasov-Poisson system at the rate O(c~'); see [AU86, Deg86] for similar 
results and [Lee04] for the case of two spatial dimensions. The respective 
Newtonian limits of other related systems are derived in [Ren94, CL04]. 

It was one aim of this project to replace the Vlasov-Poisson system by 
other effective equations to achieve higher order convergence and more pre- 
cise approximations. In [BK05] this led to an effective system whose solutions 
stay as close as O(c~?) to a solution of the full Vlasov-Maxwell system. In the 
context of individual particles, this post-Newtonian (PN) order of approxima- 
tion is usually called the Darwin order; see [KS00, Spo04] and the references 
therein. We also mention that weak convergence properties of other kinds of 
Darwin approximations for the Vlasov-Maxwell system have been studied in 
[DR92, BF*03]. 

In the next order, and in analogy to the case of individual particles [KSO1], 
radiation effects play a role for the first time. Therefore the well-known prob- 
lems related to the existence of unphysical solutions, usually called “run-away 
solutions”, are expected to turn up [Jac99]. For individual particles these 
problems can be resolved rigorously by restricting the dynamics to a suitable 
center-like manifold in the phase space; see [KS01, Spo04]. Since the phase 
space of the Vlasov-Maxwell system is infinite dimensional (as densities are 
considered), it is clear that several new mathematical difficulties have to be 
surrounded in this step. In [Bau06b] we determined effective equations for the 
Vlasov-Maxwell system on the center manifold, which led to a slightly dis- 
sipative Vlasov-like effective equation, free of “run-away” solutions, and we 
proved that solutions of these equations stay as close as O(c~*) to a solution 
of the full Vlasov-Maxwell system. 

Compared to systems of coupled individual particles, for the Vlasov- 
Maxwell system one immediately is faced with the fact that so far only the 
existence of local solutions is known in general. These solutions are global 
under additional conditions, for instance if a suitable a priori bound on the 
velocities is available; see [GS86]. This means that from the onset we will have 
to restrict ourselves to solutions of (RVM.) which are only defined on some 
time interval [0,7] that may be very small. On the other hand, in [Sch86] it 
has been shown that such a time interval can be found which is uniform in 
c > 1, so it seems reasonable to accept this limit. 

In Sect. 2.1 we first carry out a formal expansion of (RVM,) in c7! as 
c — oo. It turns out that up to the Darwin order this approximation yields 
the correct effective system. However, for the next order this naive procedure 
does not yield the optimal result. To get a clue on how an improved approx- 
imation may look like we derive the leading order dipole radiation term in 
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Sect. 2.2, and thereafter we establish an improved system (called the radia- 
tion approximation) which gives a better approximation to the full Vlasov- 
Maxwell system. More details on this are elaborated in Sect. 2.3. Finally a 
comparison of the continuous models to the individual particle models is done 
in Sect. 2.4. 


2.1 The naive post-Newtonian expansion 


We adopt the definition of a post-Newtonian approximation from [KRO1b]; 
see also [Ren92] for the Einstein case. Thus the matter and the fields are de- 
scribed by a one-parameter family (f*(c), E(c), B(c)) of solutions to (RVM,), 
depending on the parameter c € [co, o[. This means that (f*(c), E(c), B(c)) 
describes a family of solutions of physical systems which are represented in 
parameter-dependent units, where the numerical value of the speed of light is 
given by c. A more conventional physical description of the post-Newtonian 
expansion would say that in a fixed system of units the occurring velocities 
are small compared to the speed of light. Taking this viewpoint means that 
we consider (RVM,) at a fixed c (say c = 1) by rescaling the prescribed non- 
negative initial densities f°*+, for which we suppose that f% € C% (R? x R3) 
have compact support. To be more precise, denote U = f f pf°*(x,p) dx dp, 
where ĵ is taken for c = e~'/?; cf. (2.1). Then @ is viewed as an average veloc- 
ity of the system. Then we introduce f°* (a, p) = €3/? f° *+(ea,e~1/2p) and 
consider f°*** for c= 1. It follows that 


v= for*<(e.p)dedp= ve | | bF) dv dp = veo, 



































i.e., the systems with initial distribution functions f°'** have small velocities 
compared to the systems associated to f+. Under this scaling the masses re- 
main unchanged, as f f f°*<(x,p)dadp = f f f°*(a,p) dx dp. Next observe 
that (f+, E, B) is a solution of (RVM,) with c = e~'/? if and only if 




















FP (t, 2, p) = 29/9 f*(e77t, ex, ep), 
EF (t,x) = e? E(e9/*t, ex), (2.3) 
BY (t,x) =e? B(e?/"t, ex), 


is a solution of (RVM_) with c = 1. By definition of the rescaled fields these 
fields are slowly varying in their space and time variables. Thus the limit 
c — oo corresponds to an adiabatic limit. Henceforth we will return to the 
original formulation and consider the system as c — ov, treating the speed 
of light c as a parameter. However, due to the rescaling outlined above all 
theorems can also be formulated in a parameter independent fashion. In that 
case the value of c is fixed, say c = 1, and the initial data have to be modified 
according to (2.3); see [BK05] for details. 

We start with a formal expansion of all quantities occurring in (RVM,) in 
powers of c71, 
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F= fo +e A +e f +e fg Hise +4 

E = Eo + tE +e? E +e3%Fst..., 

B = Bo +c 'Bı +c "Bote 3B3t...; (2.4) 
p = po ++p +e? p +e? t... 












































j = jo +e Ate rj + eja tnn. 


In addition, also the initial densities are assumed to allow an expansion as 
FOE = fo tol” +... Finally p = p — (c~?/2)p?p +... by (2.1), where 
p? = |p|?. These expansions can be substituted into (RVM.). Comparing 
coefficients at every order gives a hierarchy of equations for the coefficients. 
The equations at order k will be addressed as the k/2 PN equations, and 














k 


k k 
Poe Sees BENS} ee, BPN N eiB, 
j=0 j=0 j=0 








is the k/2 PN approximation. This notation is used due to the fact that in 
the context of general relativity post-Newtonian approximations are usually 
counted in orders of c~?. 

At order zero the well known Vlasov-Poisson system of plasma physics is 


obtained, 











Oi fei +p: Vaf? + Eo: Vp le =0, 





Eo(t, £) = — J |z| T?Z po(t, £ + z) dz, 


po = | (at — fo ap, 
fo (0, x, p) T fo (x, p), 


(VP plasma) 








where Z = |z|~z. Note that the degrees of freedom of the electromagnetic 
fields up to this order are lost, reflecting that the limit c — oo is singular and 
the hyperbolic field equations become elliptic. As mentioned above, this 0 PN 
approximation is made rigorous in [Sch86]. 

Concerning a general k we assume that the lower order coefficients have 
already been computed. Then the fields at order k have to solve 


V x Ex = —OBx-1, V - Ek = 4T Pk, 
V x Bk = 0Ek—1 + 4Tjk—1, V- B = 0. 


The Vlasov equation to that order is 





























ify +p: Vefk + Eo- Vole = F Ek. Volo + Ry : 























where the Ry can be calculated from the known quantities f°, Vaf, VpJfj > 
Ej, and B; for j = 0,...,k—1. A special feature of this hierarchy is as follows. 
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If we assume for the initial data that f;’~ = 0 for all odd k, then using the 
explicit form of Rẹ it can be shown that we can set 





foi =0, Fai =0, By =0 (2.5) 


consistently, for l = 0,1,2,.... This simplification will be employed through- 
out. To solve the equations for (fk, Ek, Bk), we observe that once Ep is known, 
then fp can be calculated using characteristics. Note that for all orders k the 
characteristic flow is determined by the vector field (p,+o). On the other 
hand, if the fọ are known, then px and jp are fixed. Using the vector identity 
—V x Vx +VV- = A, we can rewrite the field equations as 











Eo, = 40 AT! (V pok + Orjor—2) + A+ (0? Ezk—2), 
Bək+1 = A7! (0? Bop_1) — 42 AT! (V X Jor), (2.6) 


where quantities carrying a negative index are understood to be zero. Assum- 
ing that all densities are compactly supported we can solve these field equa- 
tions. Of course without boundary conditions the solutions are not unique, 
and at least for higher orders they will not vanish at infinity. Nevertheless, if 
we take those fields, then the coupled equations can be solved by a fixed-point 
iteration for Æp. Thus on a formal level the (naive) PN approximation scheme 
is well defined. 
According to this scheme, B is given by 


By(t,x2) = i |z|~*z x jo(t, £ + z) dz, (2.7) 





where jo = f p(fo’ — fo ) dp. The couple (fF, E2) is the solution to 

















Of +p: Vaf? + Eo Vp fy 
= 3p? p: Vafo + (Ea +p x Bi) Velo 


E(t, £) = 5 [zotet 2) dz- faote) dz 


- f lz p(t, 2 + 2) dz, 














II 


p2 


JO- i)o, 
f2(0, x£, p) = f3 (2, p). 





(LVP plasma) 
Thus the 1 PN approximation (Darwin approximation) corresponding to k = 2 
is 

















JEVEN = f +e ATF, EiPN = Eo + c7? Es, B! PN z c1 Bi. (2.8) 


This Darwin system is Hamiltonian in the following sense. If the conserved 
energy 
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e= | | VRPT + raed E fees Bae 


of (RVM,) is expanded according to (2.4), then the Darwin energy Ep = 
Ep, kin + Ep, pot is obtained. Explicitly its kinetic energy and potential energy 
parts are given by 


Sou = i / [(p?/2 — 7 2p4/8) (ft + fr) + 7 2p2/2( FF + f7 )] de dp, 
Eppo = + fe + 2c~* Eo - Ey +c? B3 de. 


It can be checked that Ep is conserved along solutions of the 1 PN approxi- 
mation. The approximation properties of the Darwin system w.r. to solutions 
of the full Vlasov-Maxwell system are investigated in [BK05]. In this paper it 
is shown that if we adapt the initial data (2.2) to suit the initial data of the 1 
PN approximation, then the solutions are tracked down with an error of order 
c73. Hence the naive post-Newtonian expansion is valid up to this order. 

For the next level (k = 3 or 1.5 PN), Bs = A~1(0?Bi) — 47 A7!(V x j2) 
and also By = —4nA71(V x jo) by (2.6). Thus B3 = —4r7A~?(0?V x jo) — 
4nA~1(V x jz) allows for the solution 


1 
B3(t,x2) = 5 fle O2V x jolt, z + z) dz+ fv x jalt, £ +z)dz, (2.9) 


where 


n= [w(t - f3)- (p?/2)p (ft — fod 


It follows that freer = fF te heen = fo Bee = Eo + 

c? Ez +c 3E3 = E'PN and BLS PN = et Bi + eB; = BYPN + e3 Ba, 
ie to (2.5) and (2.8). Therefore the energy Ep from above does not have 
to be changed in comparison to the 1 PN order. Hence at the 1.5 PN order 
(corresponding to c7?) we would obtain a Hamiltonian system with no effects 
due to radiative friction visible. This suggests that the naive post-Newtonian 
approximation has to be improved in order to resolve such effects which are 
indeed present in the system. In order to get a clue on how such a refinement 
should look like it is useful to study the energy which is radiated to future 
null infinity by the full Vlasov-Maxwell system. 























2.2 Dipole Radiation 


The starting point for the following calculation is the local energy conservation 
Oe +V-P = 0 for classical solutions of the Vlasov-Maxwell system. The energy 
density e and the momentum density P are given by 


elta) = è | YTE YS + S mp) dp + (IE a) + BE ®)P), 


P(t, £) = a for + f~ )(t, x, p) dp + D Elta) x B(t, x). 
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Defining the local energy in the ball of radius r > 0 as €,.(t) = eee e(t, x) dx, 
this conservation law and the divergence theorem imply that 4 €,(t) = 
Siew - P(t, x) do(x), where z = |x|~tx denotes the outer unit normal. 
Our assumptions on the support of the distribution functions are such that 
the contribution of f p(f*+ f~) dp to P vanishes for |x| = r large. Hence we 


arrive at d 
C 2 
Fol) =e [ (BX B)(t,2) do(2). 


Therefore the energy flux radiated to null infinity at time t is obtained as 


lim & Z- (B x E)(t+c"'r, x) da(2), 


r—oo 4r ne. 


where t+ ctr is the advanced time. In [BL*06, Thm. 1.4] it is shown that 
for suitable solutions of the Vlasov-Maxwell system which are isolated from 
incoming radiation in the limit c — oo the total amount of radiated energy is 
given by 

2 

303 
where D is the dipole moment of the Newtonian limit (VPpiasma), defined as 
D(t) = f xpo(t, x) dx; see [BL*06, Thm. 1.4] for the exact statement and the 
remarks below for the retarded system (re RVM.) which models “no incom- 
ing radiation”. This result yields a mathematical formulation and a rigorous 
proof of the Larmor formula in the case of Vlasov matter. Returning to the 
approximations, we should therefore introduce into the effective equation a 
radiation reaction force causing this loss of energy. As already suggested in 
[KROla, KRO1b] we thus modify the Vlasov equation of the Newtonian dis- 
tribution by incorporating a small correction into the force term as 


ID()/?, (2.10) 














+ $ Dp ce on 
dfo +p: Valo + (a + ab) -Vp fo =O: (2.11) 


The additional term is the generalization of the radiation reaction force used 
in particle models; see [Jac99, (16.8)]. We also note that for this system the 
quantity 


EO = 5 | [POE + REE) dedp+ = f Eola)? dx 
De r . 
SEE E 


7E (t)- D(t) (2.12) 
is decreasing. More precisely one obtains 4 Es(t) = -l| Ď(t)|?, cf. (2.10). 
The subscript S' refers to Schott who considered similar quantities for particle 
models; see [Spo04]. Although Es has no definite sign, its decrease can be 
attributed to the effect of radiation damping. If instead of Es the usual positive 
energy Evp = 5 f fv? (fo + fo ) dx dp + $ Jf |Eo|? dx of the Vlasov-Poisson 
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system is considered, then along solutions of (2.11) the relation 4 Evp = 
(D - D — | D|?) is obtained which has no straightforward interpretation. 

Thus we consider (2.11) to be a promising candidate of an effective equa- 
tion for the relativistic Vlasov-Maxwell system. However, one immediately 
runs into the problem that initial data have to be supplied for D(0), D(0), 
and Ď(0), as third derivatives of D occur in the equations. Since only D(0) 
and D(0) are determined by f5’* and since there is no obvious way to extract 
the missing information from the approximation scheme, additional degrees 
of freedom seem to be generated. This phenomenon is also known from the 
theory of accelerated single charges and leads to a multitude of unphysical (so- 
called run-away) solutions. In [KS01] it has been observed that in the particle 
model this problem has the structure of a geometric singular perturbation 
problem, and the “physical” dynamics are obtained on a center-like manifold 
of the full dynamics. 

In order to adopt this language to the model under consideration, we 
assume that we are supplied with a (local in time) classical solution (fj, Eo) 
of (2.11) and assume that the support of f5-(¢, -,-) remains compact for all t in 
the interval of existence of the solution. We define the bare mass by M(t) = 
I [Cfo + fo )(t, x, p) dx dp. Then (2.11) yields mass conservation 0,M = 0 
as well as charge conservation for both species rpg + V - jg = 0, where 
jo = J pfo dp and pg = f fo dp. From D = f xpo dx and (2.11) we then find 
that D = I [vp (fo — fo ) dx dp and D = D] + z MD, where 






































Det) = J / Bo(t,x)( fet + fo )(t, £, p) de dp. (2.13) 


Defining y = D and 7 = 33M, this can be rewritten as y = DP + nD. 
Putting 




















F*(fS,y) = —p: Va fo F (Eo + My- DPI). Vp fe, 
G(fe.y) =y— D”, 


it follows that (2.11) can be recast as the singular perturbation problem 


ft = F*(fE.y) 
ae, scPP, 


In contrast to [KSO1] we are dealing with a phase space of infinite dimension. 
Thus the proof of the existence of an invariant manifold is hard. We shall 
return to that question in a forthcoming paper. For the moment we shall take 
the existence of a smooth invariant manifold for granted and assume that it 
is given as a smooth graph hy = h,(f$), where hy is defined on (a subset of) 
C§ (R? xR) x Cg? (IR? xR?) and takes values in R3. For the moment fẹ denotes 
(fo, fo’), and similarly fo = (fF, fo) and F = (F+,F7). The manifold 
Mn = {(f6;hn(f6))} is invariant under the flow of (SGPP,,) if the solution of 
(SGPP,,) subject to the initial conditions (fo(0), y(0)) = (f5, hn (fG)) satisfies 
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y(t) = hn (folt, »:)). (2.14) 


We want to determine a system of Vlasov-Poisson type which is a good ap- 
proximation of the dynamics on the manifold. For this reason we assume that 
we can expand hy in 7 about 0 as hy = ho +nhı + O(n’). Setting n = 0 in the 
second relation of (SGPP,,), 0 = G( fo, ho(fo)) = ho( fo) — DP is obtained, so 
that ho = DP); note that DP! depends on fo. Thus 


ny = G( fo, hol fo) + nhi (fo) + O(n?)) = hol fo) + nhi(fo) + O(n?) — D” 
= nhi ( fo) + O(n’). 


On the other hand, differentiating (2.14) yields 
ny = n (hhl fo), fo) = n (hol fo), fo) + O(n’), 
and consequently hi (fo) = (A) (fo), F(fo, ho(fo))) by (SGPP,). Explicitly, 
(ho (fo), F( fo, ho(fo))) 
= MU): f f (=p: Velde = fo) - Bo Vote + fo)) C+.) de dp. 


From (2.14) and the above it follows that y = hy(fo) = ho(fo) + nhi(fo) + 
O(n?) = DP + n (D (fo), F(fo, DAI (fo))) + O(n?). After a straightforward 
computation we obtain 








(DPI (fo), F(fo, D?(fo))) = DM, (2.15) 
where 
D®\(t) = 2 | (H* (t,0)j5 (t,2) — Ho (t,2)i6 (t,2)) de, (2.16) 
defining 
H*(t, x) := f |z| ’(—37 8 Z + id)py (t,x + z) dz € R°*?. (2.17) 


Note that H(z) = —3(Z 9 Z) + id is bounded on R? \ {0}, homogeneous of 
degree zero, and satisfies fiaz H(z) do(z) = 0. per DPI is close to D, 
since D = 4 (DP + MD) = 4D” + O(c) = DE! + O(c?) by (2.15). 


3) 


We introduce the “reduced radiating a system” as 














$ + 2 i 
DSE +p: Vole + (Eo + =z D) Vp fet =0, 


Eo(t, £) = - fier 2z polt, £ + z) dz, 
(rrVP-) 


os jot ~ fr) dp, 


o (0, x, p) F fo ™ (z, p), 
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where DÌ] is defined by (2.16) and (2.17). The next proposition addresses the 
existence and uniqueness of local classical solutions of (rrVPe). Furthermore, 
it provides some useful estimates. For the initial data we assume 





f= E Ce (R? x R3), Aa > 0, 
fo (z, p) =0 for |z| > ro or |p| > ro, Io lliwa < So, (2-18) 








with some ro, So > 0 fixed. 





Proposition 2.1. If fo’ satisfies the above hypotheses, then there exists 0 < 
T < œ such that the following holds for c > 1. 





(a) There is a unique classical solution (fo, Eo) of (rrVP.) existing on a time 
interval [0,T.| with T < Te < oo. In addition, 4 pel = DE), where DEI 
is defined by (2.13). 

(b) For every T < T there is a constant M,(T) > 0 such that for all0 <t <T, 





fo (t,2,p)=0 if |x| > Mi(T) or |p| > Mi (TZ). 





(c) Even fF € C™ holds, and for every T < T there is constant Mo(T) > 0 
such that for al O <t<T, 





|° fa" (t, £, p)| + | Ho(t, x)| + |07 DP\(t)| < M2(T) 
for every x € RÌ, pe R3, jal < 4, 8 <2, andy <1. 


See [Bau06a] for the proof. Note that the constants T, M,(T), and M2(T) do 
only depend on the “basic” constants ro and So. In particular T, M\(T), and 
M2(T) are independent of c. Since the second moment f f p?( for + fo ) dx dp 
cannot be bounded a priori by using energy conservation, it seems difficult 
to prove global existence of classical solutions to (rrVPe). Recall that both 
methods yielding global existence of Vlasov-Poisson type systems essentially 
relied on such an a bound; see [Pfa92, Sch91, LP91]. 

By means of (rrVP,) the approximations are improved by replacing at 
order zero the solution to (VPplasma) by the solution to (rrVP.). Thus let 
(fo, Eo) be the solution of (rrVP.); note that this solutions depends on c, 
as opposed to the solution of (VPpiasma). Next define Bi, (fs, E2), and Bs 
according to (2.7), (LVPpiasma), and (2.9), respectively. We remark that so- 
lutions to (LVPpiasma) do exist on [0, Te[ (where Te is from Proposition 2.1) 
and enjoy the usual properties, provided that both f9’~ and f3’™ satisfy the 
assumptions (2.18); see [Bau06al. 

In the following section we are going to explain that 


F Safe Pe fa 
ER = Eo + c? Ep + (2/3)e3 DP, (2.19) 
BÈ = c1 B; + c° B3, 
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yields a higher order pointwise approximation of (RVM_) than the Vlasov- 
Poisson or the Darwin system considered in [BK05]. We call (2.19) the radia- 
tion approximation. In the terminology of post-Newtonian approximations it 
is the 1.5 PN approximation. 

Using the Vlasov equation and integration by parts the following formulas 
are found. 


Proposition 2.2. The fields E® and B® can be written as 


ER(t,2) = - fier a tie one ede 


se f feaa p- rebut -Ate + 2.0) dzdp 
-e f fiat{eoz+r} 


(Bolt, + 2) + (2/3) DPH) (fF + FTE, 2 + 2, p) dz dp 


+5 e3 DBI), (2.20a) 
and 
Rea) =e f f AEAEE- IN E+ 2,0) dedp 
-žes | f le eR AD\ fit -N E+ zp) de dp 
1 
T 


a 
(2 


(Eolt,z + z) + (2/3)? DË] ©) (fo + fo )(t, £ + z, p) dz dp 


-e fan (H(t, + z)jo (t,£ + 2) 


— H (t,x + z)jg (t,£ + 2)) dz. 
(2.20b) 


At the end of this section we want to mention that there is another variant of a 
damped Vlasov Poisson type system, considered in [KRO1a, KR01b]. While for 
that system a global solution theory is at hand, the authors did not compare 
approximations based on their solutions to solutions of the full system. 


2.3 1.5 PN comparison dynamics 


For the 1.5 PN comparison dynamics, as outlined in Sect. 2.2, the initial 
data (fo, fs’) are given. The field quantities are to be computed from 
the resulting densities (fF, f7) by means of (2.20a), (2.20b), FÈR = fF + 
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c`? fs, and (2.17). In comparison to that, for the Cauchy problem of (RVM.) 
the initial fields Æ° and B° also have to be specified; see (2.2). Thus it is 
the question for which choice of initial data (2.19) yields a good comparison 
dynamics and for which not. Certainly it is possible to choose initial data for 
the fields such that the densities of the two dynamics evolve in a completely 
different way. 

In the first part of this section we therefore fix the initial fields for the 
Vlasov-Maxwell dynamics from the comparison dynamics; see formula (IC) 
below. From a mathematical viewpoint this procedure comes with the addi- 
tional advantage that results on the existence and uniqueness of local-in-time 
solutions for both the full dynamics and the comparison dynamics have been 
established; see [GS86, Sch86, Bau06a]. This way it is possible in Theorem 
2.4 to obtain a pointwise approximation up to the order O(c~*). It should be 
mentioned that in [Sch86, Deg86, BKO05] the fields are adapted in the same 
way up to the relevant orders. 

There are two drawbacks of this method. In essence post-Newtonian ex- 
pansion is an expansion of the relativistic velocity p and the retarded time 
t—c'|x—y]. It is clear that assuming localized sources the expansion of the 
retarded time is only a good approximation in the near zone of the source 
where |x — y| < c. This is reflected in the fact that the estimates for the 
fields in Theorem 2.4 and Theorem 2.5 are only local in the space variable x. 
Thus also the adapted initial fields are only reliable in the near zone, as is 
underlined by the fact that they have infinite energy. From a more physical 
point of view it is moreover questionable to use the Cauchy problem at all. For 
post-Newtonian expansions the main interest lies in localized systems which 
are isolated from the rest of the world and which have already evolved for a 
long time with small velocities. Therefore the Cauchy problem might not be 
the right formulation since it is not clear how to incorporate these properties 
into the initial fields. 

In physics textbooks isolated systems are characterized by the absence of 
incoming radiation, i.e., there is no energy coming into the system from past 
null infinity; see [Cal04]. Here past null infinity is that region of spacetime 
which is reached along backward light cones. In case that the sources are 
given, fields which are free of incoming radiation are usually calculated by 
means of retarded potentials. In the second part of this section we consider 
a family of solutions of (RVM.), parameterized by c, passing through f° + 
at time t = 0; in fact the initial data may also depend on c according to 
(IC), but this dependence is suppressed in our notation. Then in contrast to 
the Cauchy problem for (RVM,) the electromagnetic fields are just computed 
by means of the retarded potentials; see (re RVM.). The underlying physical 
picture is that in the absence of incoming radiation every solution of (RVM,) 
will approach a solution of (re RVMo), i.e., solutions of (ree RVM_) form a kind 
of initial layer. Since it is our goal to model slow systems we assume that 
the momenta are bounded uniformly in c > 1 and time t € R. It is beyond 
the scope of the present survey paper to investigate the existence of solutions 
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with mathematical rigor. Instead we simply introduce Assumption (A) which 
summarizes all the properties needed. Note however that in [Cal04] the exis- 
tence of such global solutions is proved for small f°* and also uniqueness is 
discussed. 





Viasov-Maxwell dynamics with adapted initial data 


To achieve the improved approximation accuracy we match the initial data of 
(RVM.) by the data for the radiation system. For prescribed initial densities 
fo’ and f:’~ we determine (fg , Eo), Bi, (f3 , £2), and Bs according to what 
has been explained above. Then we consider the Cauchy problem for (RVM,), 
where the initial data are taken as 


f*(0,2,p) = fE (x, p) = fo (x, p) +? fy" (x, p) +O fo ireel® P) 

E(0,2) = E° (x) = Eo(0, £) + c7? E2(0, £) + (2/3)? DEl (0) 
POSTE. pega) 

B(0,£) = B° (x) = c7'By(0,x) + c’? B3(0,2) + c74 B? teelt). 





























(IC) 
In contrast to the contributions at orders 0 to 3, which are fixed by the values 
of the approximations, ( Jeke E? trees Be, tree) can be chosen freely. They are 





only subjected to the constraints V - FÈ fee = 4T S e — fe tree) dp and 
V - B? tee = 0. Note that the constraint equations at the lower orders are 
satisfied by fiat. Furthermore, we shall assume that 





Te free € c™(R° x Rĉ), Ee frees Be tes € Ce (RÌ), 
Fee =0 if |z|>ro or |p| ro, (2.21) 


free 


ISE frecl ges < So, EE reeline + ||- B2, reela s < So, 








holds uniformly in c, with some ro, So > 0 fixed. Before we formulate the 
approximation result, let us recall that solutions of (RVM_) with initial data 
(IC) exist at least on some time interval [0, Î[ which is independent of c > 1; 
see [Sch86, Thm. 1]. 








Proposition 2.3. Assume that fj’ and fy’~ satisfy (2.18). If f° *, E°, and 
B° are defined according to (IC), then there exists 0 < T < œ (independent 
of c) such that for all c > 1 there is a unique smooth solution (f=, E, B) of 
(RVM.) with initial data (IC) on the time interval [0, T|. In addition, for every 
T <T there are constants M3(T), M4(T) > 0 such that for allO0 <t<T, 








f*(t,2,p)=0 if || > M3(T) or |p| > M;(T), 
[f7 (t,x, p)| + |EG, x)| + |B, x)| < Ma(T), 





for every x,p E€ RÌ andc>1. 
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Actually in [Sch86, Thm. 1] E° and B° do not depend on c, but an inspection 
of the proof shows that the assertions remain valid for initial fields as defined 
by (IC). 

The first main approximation result at 1.5 PN is as follows; see [Bau06b]. 
Theorem 2.4. Assume that fj’ and f3'™ satisfy (2.18). Then calculate 
(fo. Eo), Bi, (f3, E2), and B3 by means of (rrVP.), (2.7), (LVPpiasma), 
and (2.9), respectively. Thereafter choose the initial data (f°, E°, B°) for 
(RVM.) according to (IC) and (2.21). Let (f,E,B) denote the solution of 
(RVM_) with initial data (IC) and let (f*", E®, BÈ) be defined by (2.19). 
Then for every T < min{T,T} andr > 0 er are constants M(T) > 0 and 
M(T,r) > 0 such that for allO<t<T, 


I= t, xp) — JE t, x, p)| < M(T)e* (x €R’), 
|E(t,x) — E? (t, £)| < M(T,r)e* (fæl <r), 
|B(t,x) — B? (t, x)| < M(T,r)e* (lel <r), 


























for every p € R? and c> 1. 


The constants M(T) and M(T,r) are independent of c > 1, but they do 
depend on the basic constants ro and So. Note that if (RVM.) is compared 
to the Vlasov-Poisson system (VPpiasma) only, one obtains the Newtonian 
approximation 


|f*(t, x, p) — fo (t,x, p)| + E(t, £) — Eo(t, x)| + |B, x)| < M(L)c™ 

see [Sch86, Thm. 2B]. If it is compared to the Darwin system the estimates 

IŻ 2, p) — f*" PNG, x, p)| + |B, x) — BYPNG,2)| < M(T)e* 
\E(t,2) — BPN(t,2)| < M(T.ne’, 

are found; see [BK05, Thm. 1.1] and recall (2.8). At first glance it could 

seem it is a strong limitation to Theorem 2.4 that the time interval [0,7] c 

(0, min{T, T}[ might be very small, as T and T might be very small. Regarding 

this point we remind the rescaling from Sect. 2.1 which allows to reformulate 


the result in an e-dependent fashion on the time interval (0, ¢~3/?T] as € — 0, 
i.e., for long times. 














The retarded Vlasov-Maxwell dynamics 


Following [Cal04] we introduce the retarded relativistic Vlasov-Maxwell sys- 
tem as 

















Of*+p:-Vef* (E+ ctp x B)-V,f* =0, 


Sea “(Vo + ai) le yl), 
dy 

|z — yl 
a j= [ôE - F) 


(ret RVM_) 


B(t, £) = —— V x j(t—c*|z — yl, y), 
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If we assume that (f+, E, B) is a smooth solution of (;¢¢.RVM_), then p and j 
satisfy the continuity equation 0,0+V-j = 0. Therefore the retarded fields are 
a solution of the Maxwell equations, i.e., (f, Æ, B) also solves (RVM). Note 
that it is necessary to know the densities for all times | — 00, ¢] in order to 
compute the fields at time t. Hence there is no sense to the notation of a local 
solution of this system. As in the case of the Cauchy problem every solution 
of (ret RVM_) satisfies f*(t,x,p) = f+(0, X=(0; t, x, p), P*(0;t, x, p)), where 
s œ (X*(s;t, x, p), P=(s;t,x,p)) solves the characteristic system 





























X=P, P=+(E+c'!P xB), (2.23) 











with initial data X+ (t; t; x, p) = gz and P+ (t; t, x, p) =p. Thus 0 < fet, E) < 
| f*(0, -,-)||,, holds. 

As before let fo’, f’~, and f? 
Put 




















satisfy (2.18) with some ro, So > 0. 


free 




















foe = fom FeO fy +O Se ree: 
We make the following Assumption (A): 





(a) For every c > 1 there is a global solution f+ € C4(R x R? x R°) of 
(ret RVM.) passing through f= at time t = 0, i.e., f+(0,2,p) = f= (x, p) 
for xz, p € R. 

(b) There is a constant P, > 0 such that f=(t, x, p) = 0 for |p| > Pı and c> 1. 
In particular, f®(t, x, p) = 0 for |x| > ro + Pı |t| by (2.23). 

(c) For every T > 0, R > 0, and P > 0 there is a constant M5(T, R, P) > 0 
such that 























[apt fE (t, x, p)| + |OfVaf*(t, £, p)| < Ms(T, R, P) 


for |t| < T, |æ| < R, |p| < P, and |a| < 3, uniformly in c > 1. 





Our second main approximation result at 1.5 PN is taken from [Bau06b]. 





Theorem 2.5. Assume that (f*, E, B) is a family of solutions of (re RVMe) 
satisfying Assumption (A) with constants P, and M5(T, R,P). Take T > 0 
from Proposition 2.1. Then for every T < T and r > 0 there are constants 
M(T) >0 and M(T,r) >0 such that for all 0<t<T, 








|+ (t,x, p) — f**(t,2,p)|< M(T)e* (x ER’), 
E(t, £) — E? (t, x)| < M(T,r)c™* (\e| < r), 
|B(t, £) — BR(t,a)| < M(L,r)e* (æl <r), 
for every p € R? and c > 2P}. The constants M(T) and M(T,r) do only 


depend on ro, So, Pi, and Ms(-,-,-). In particular they are independent of 
Cc a 2P,. 


Radiative Friction for Charges Interacting with the Radiation Field 547 
2.4 Comparison to the particle model 


We shall compare our results for the continuous density Vlasov models to the 
corresponding results for individual particle models. Usually the latter are 
denoted the Abraham-Lorentz system; see [KS00, KS01, Spo04]. Both systems 
are quite similar for the Hamiltonian approximations up to 1 PN. In addition, 
dissipative corrections at 1.5 PN make it necessary to have a closer look at 
the underlying phase space. The true comparison dynamics lives on a center- 
like manifold in an extended phase space. In addition, the dynamics on this 
manifold can be approximated by a modified Vlasov-Poisson system coupled 
to a second order equation, as in (SGPP,,). In [KRO1b, Sect. 3] it is argued that 
the force term of the 1.5 PN approximation used in that paper can be obtained 
formally in the limit “number of particles — oo” from the individual particle 
models. In comparison to [KS00, KS01] the main difference to the present 
paper lies in the treatment of the initial data. For the individual particle model 
the initial data for the fields are supposed to be of “charged soliton” type. 
One can think of these fields as generated by charges forced to move freely 
for —co < t < 0 with their initial velocities. For the approximation this leads 
to an initial time slip tọ which the charges need to “forget” their initial data. 
The initial data for the approximation are then fixed by matching the data 
of the full system at time to. Therefore the initial data for the approximation 
are given only implicitly, since first one has to compute a solution of the full 
system over [0,to]. Regarding the Cauchy problem for the Vlasov-Maxwell 
system, the matching is done the other way round. For a given initial density 
one computes the fields of the approximations and imposes their values at 
t = 0 as initial data for the fields of the full system. Hence these initial data 
are given more explicitly. In fact it is possible to calculate them by fixing only 
fo’ and f3’~; see (2.20a) and (2.20b). Also Theorem 2.4 and Theorem 2.5 
seem to be stronger than the results obtained for the particle model, as the 
passage from 1 PN to 1.5 PN did improve the approximation only in certain 
directions [KSO1, (3.21), (3.32)]. It seems reasonable to expect that a matching 
of the initial data at t = 0 as described above could also improve the earlier 
results on the Abraham-Lorentz system. 








3 The Vlasov-Nordstrom system 


Recently there has been some interest in a simplified but still relativistic 
model of gravitation in which Vlasov matter is coupled to a scalar theory of 
gravitation; the latter essentially goes back to Nordström [Nor13]. The metric 
tensor used in General Relativity is replaced by a scalar function and the 
Einstein equations are replaced by a wave equation. In [CR03] the following 
system (VN,) has been considered. 


S(f)— (Slop + 12V a) -Vpf =48(y)f, -0p +? Ary =4np, 
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where u = fyf dp. Once again f = f(t, x,p) denotes the density to find a 
particle at time t at position x with momentum p, where t € R, x € RÌ, and 
p € R®. The scalar gravitational potential p = y(t,x) is generated by the 
particles via the source u, and c denotes the speed of light. In addition, 


pP =l, y=(1+e°p?) 17, paryp, and S= ô +P Ve. 
The initial data are 


f(0, x, p) = F(x, p), y(0, x) T p’ (x), Or(0, x) = p(x). 


For a physical interpretation and a derivation of this system see [CL04]. In 
this formulation (VN,) exhibits many similarities to the relativistic Vlasov- 
Maxwell system. Thus it is not surprising that many techniques developed 
for the Vlasov-Maxwell system also apply to the Vlasov-Nordström system. 
However, regarding basic questions like existence and uniqueness of solutions 
(VN,) is by far better understood than (RVM,). Global existence of classical 
solutions for unrestricted data is proved in [Cal06], also see [Lee05] for the 2D 
case. 

Again we are concerned with the non-relativistic limit c — oo of (VN,). 
Under certain circumstances it has been made rigorous in [Ren94] that the 
(gravitational) Vlasov-Poisson system is the non-relativistic limit of the full 
Einstein-Vlasov system. In [CL04] it has been shown that as c — oo also 
solutions of (VN,) converge to a solution of a Vlasov-Poisson system with an 
error of the order O(c~'). These facts support the belief that (VN.) may serve 
well as a model problem for Einstein-Vlasov. 

To derive the higher order (post-Newtonian) approximations we follow the 
naive approach from Sect. 2.1 and first expand all relevant quantities in powers 
of cot, 


f=fotetfte "ftw... 
p = po + ctu +e? p... 
Y = Yo cly c 9 Prata 














Comparing coefficients yields the equations — Aryo = 0 and —Az91 = 0. 
Thus we set Yo = y1 = 0. As mentioned above, at order zero the gravitational 
Vlasov-Poisson system (VP grav) 


ROEE E E wo = f fodo, 
AEF J tates. Onr, 


is obtained; see [CL04] for a proof including the necessary error estimates. At 
the first order the linearized Vlasov-Poisson system 
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Of: +p: Veli -— Vevps: Vp fo — Va v2: Vo fir = 9, 
m= | fidp, Ags = 4ru + OF ¢1, 


appears. Hence if we suppose that f1(0,2,p) = 0, then we can set fı = 0 
and y3 = 0, which also yields 4; = 0. For the second order one derives an 
inhomogeneous Vlasov equation coupled to a Poisson equation (LVP grav), 


On f2 +p £ Vale bza Ve p2: Vp f2 = Va Vp fo 
= 4fo5(y2) + (P?/2) p- Vafo + (S(¢2)p - (9?/2)Vave2) Vofo, 


u2 = fie — (p°/2)fo)dp, Anya = 4rp2 + OF 2, 


where Õ = & + p- Vz. For (LVPerav) we choose homogeneous initial data 
f2(0, x, p) = 0. 

We now follow the route of adapted initial data as above. Similarly as 
before it would also be possible to approximate solutions to the retarded 
Vlasov-Nordstrém system, the latter being defined analogously to (ret RVM-). 
For the retarded Vlasov-Nordstrom system the leading order radiation contri- 
bution can be determined explicitly. It is due to monopole radiation and does 
not vanish for spherically symmetric solutions [BL*06, ST93). 

For the adapted initial data let f° be given. Then we calculate (fo, y2) 
and (f2,y4) according to (VPgrav) and (LVPgray). Now we consider (VN,) 
with initial data 


f(0,2,p) = f°(x,p), 
p(x) = y(0, x) = c~*y2(0, x) + c*y4(0, x) ag i Ones (B): 
y" (zx) ag sp(0, x) T c 7A p2(0, x) F c *0,y4(0, x) ia C Pe ala) 


where 9? .., Yiee E€ CE(R?). The following approximation theorem from 
[Bau05] shows that the Darwin approximation 


f= foto "fe, gP =e pote ty, 


yields a higher order pointwise approximation of the Vlasov-Nordstrém (VN,). 
Before we formulate this result let us recall that solutions of (VN.) with 
matched initial data do exist at least on some time interval [0,7] which is 
independent of c > 1; see [CL04, Thm. 3]. 


Theorem 3.1. Assume that f° € C®(R? x R?) is nonnegative and compactly 
supported. From f° calculate (fo, p2) and (f2,y~4). Thereafter introduce the 
matched initial data for (VN.) as above. Let (f, p) denote the corresponding 
solution of (VN.) and let (f?,p?) be defined as the Darwin approximation. 
Then there is a constant M(T) > 0, and for every r > 0 one can select 
M(T,r) > 0, such that for all0 <t<T, 
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S(t, x, p)— fP (t,£,p)| < M(T)c* (x ER’), 
lolt, £) — p? (t, x)| < M(T,r)c* (lel <1), 
lAv(t, £) — D (t,)|<M(T)c* (æ €R°), 
|Vep(t, £) — Vep?” (t, x)| < M(T, r)? (\z| <r), 


for every p € R? and c > 1. The constants M (T) and M(T,r) are independent 
ofc>1. 
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Summary. Numerical methods for oscillatory, multi-scale Hamiltonian systems are 
reviewed. The construction principles are described, and the algorithmic and ana- 
lytical distinction between problems with nearly constant high frequencies and with 
time- or state-dependent frequencies is emphasized. Trigonometric integrators for 
the first case and adiabatic integrators for the second case are discussed in more 
detail. 


1 Introduction 


Hamiltonian systems with oscillatory solution behaviour are ubiquitous in 
classical and quantum mechanics. Molecular dynamics, in particular, has mo- 
tivated many of the new numerical developments in oscillatory Hamiltonian 
systems in the last decade, though the potential range of their applications 
goes much farther into oscillatory multi-scale problems of physics and engi- 
neering. 

Since the publication of the last review article on the numerical solution of 
oscillatory differential equations by Petzold, Jay & Yen [PJY97] in 1997, algo- 
rithms and their theoretical understanding have developed substantially. This 
fact, together with the pleasure of presenting a final report after six years of 





Fig. 1.1. Oscillations and long time steps 


554 D. Cohen, T. Jahnke, K. Lorenz, Ch. Lubich 


funding by the DFG Priority Research Program 1095 on multiscale systems, 
have incited us to write the present review, which concentrates on Hamiltonian 
systems. A considerably more detailed (and therefore much longer) account 
than given here, appears in the second edition of the book by Hairer, Lubich 
& Wanner [HLW06, pp. 471-565]. Numerical methods for oscillatory Hamil- 
tonian systems are also treated in the book by Leimkuhler & Reich [LR04, 
pp. 257-286], with a different bias from ours. 

The outline of this review is as follows. Sect. 2 describes some classes of os- 
cillatory, multi-scale Hamiltonian systems, with the basic distinction between 
problems with nearly constant and with varying high frequencies. Sect. 3 
shows the building blocks with which integrators for oscillatory systems have 
been constructed. As is illustrated in Fig. 1.1, the aim is to have methods that 
can take large step sizes, evaluating computationally expensive parts of the 
system more rarely than a standard numerical integrator which would resolve 
the oscillations with many small time steps per quasi-period. Sect. 4 deals 
with trigonometric integrators suited for problems with almost-constant high 
frequencies, and Sect. 5 with adiabatic integrators for problems with time- or 
solution-dependent frequencies. 


2 Highly oscillatory Hamiltonian systems 


We describe some problem classes, given in each case by a Hamiltonian func- 
tion H depending on positions q and momenta p (and possibly on time t). 
The canonical equations of motion p = —V,H, ¢ = VpH are to be integrated 
numerically. 


2.1 Nearly constant high frequencies 


The simplest example is, of course, the harmonic oscillator given by the Hamil- 
tonian function H(p,q) = p° + 4w?q?, with the second-order equation of 
motion ¢ = —w?q. This is trivially solved exactly, a fact that can be exploited 
for constructing methods for problems with Hamiltonian 


H(p, q) = ÍT Mp + 5a" Ag+ U(q) (2.1) 
with a positive semi-definite constant stiffness matrix A of large norm, with a 
positive definite constant mass matrix M (subsequently taken as the identity 
matrix for convenience), and with a smooth potential U having moderately 
bounded derivatives. 

The chain of particles illustrated in Fig. 2.1 with equal harmonic stiff 
springs is an example of a system with a single high frequency 1/e. With the 
mid-points and elongations of the stiff springs as position coordinates, we have 
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stiff | soft 
harmonic nonlinear 


Fig. 2.1. Chain with alternating soft nonlinear and stiff linear springs 


1 /0 0 
A=3(5 J O<e<l. (2.2) 


Other systems have several high frequencies as in 
1 
A= = diag(0,01,...,Wm), 0<e<l, (2.3) 
E 


with 1 < w1 < -++ < Wm, or a wide range of low to high frequencies without 
gap as in spatial discretizations of semilinear wave equations. 

In order to have near-constant high frequencies, the mass matrix need not 
necessarily be constant. Various applications lead to Hamiltonians of the form 
studied by Cohen [Coh06] (with partitions p = (po, pı) and q = (qo, q1)) 


1 a E 1 
H(p, q) = 5Po Mola) ‘pot 5P1 M; ‘pit5p R@)pt+ sau Ain +U(q) (2.4) 


with a symmetric positive definite matrix Mo(q), constant symmetric positive 
definite matrices Mı and A;, a symmetric matrix R(q) with R(qo,0) = 0, 
and a potential U (q). All the functions are assumed to depend smoothly on q. 
Bounded energy then requires qı = O(c), so that pT R(q)p = O(c), but the 
derivative of this term with respect to qı is O(1). A simple example of (2.4) 
is given by a triatomic (water) molecule as illustrated in Fig. 2.2, with strong 
linear forces that approximately keep the distances and the angle fixed. 


2.2 Explicitly time-dependent high frequencies 


Here the prototype model is the harmonic oscillator with time-dependent 
frequency, H (p,q, t) = ip tie? w(t)? with w(t) and w(t) of magnitude ~ 1 
and €e < 1. Solutions of the equation of motion g = —e~?w(t)?q oscillate with a 
quasi-period ~ £, but the frequencies change on the slower time scale ~ 1. The 
action (energy divided by frequency) I(t) = H (p(t), ¢(t))/w(t) is an almost- 
conserved quantity, called an adiabatic invariant; see, e.g., Henrard [Hen93]. 


Fig. 2.2. Triatomic molecule 
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Numerical methods designed for problems with nearly constant frequencies 
(and, more importantly, nearly constant eigenspaces) behave poorly on this 
problem, or on its higher-dimensional extension 


H(p,4,t) = 5p"M(t)"'p + saa" Aa + Ula), (2.8) 
which describes oscillations in a mechanical system undergoing a slow driven 
motion. Here M(t) is a positive definite mass matrix, A(t) is a positive semi- 
definite stiffness matrix, and U (q, t) is a potential, all of which are assumed to 
be smooth with derivatives bounded independently of the small parameter e. 
This problem again has adiabatic invariants associated with each of its high 
frequencies as long as the frequencies remain separated. However, on small 
time intervals where eigenvalues almost cross, rapid non-adiabatic transitions 
may occur, leading to further numerical challenges. 


2.3 State-dependent high frequencies 


Similar difficulties are present, and related numerical approaches have recently 
been developed, in problems where the high frequencies depend on the posi- 
tion, as in the problem class studied analytically by Rubin & Ungar [RU57], 
Takens [Tak80], and Bornemann [Bor98]: 


1 _ 1 
(p,q) = 5p" M(@) ‘p+ 3V (4) + Ula), (2.6) 
with a constraining potential V(q) that takes its minimum on a manifold 
and grows quadratically in non-tangential directions, thus penalizing motions 
away from the manifold. In appropriate coordinates we have 


wd 


501 A(qo)a for q= (qo,%1) 


V(q) 
with a positive definite matrix A(qo). 

A multiple spring pendulum with stiff springs as illustrated in Fig. 2.3 is a 
simple example, with angles as slow variables go and elongations of stiff springs 
as fast variables qı. In contrast to the triatomic molecule of Fig. 2.2, where 
also the angle is kept approximately constant, here the frequencies of the high 


Fig. 2.3. Triple pendulum with stiff springs 
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oscillations depend on the angles which change during the motion. Different 
phenomena occur, and different numerical approaches are appropriate for the 
two different situations. 

As in the case of time-dependent frequencies, difficulties (numerical and 
analytical) arise when eigenfrequencies cross or come close, which here can lead 
to an indeterminacy of the slow motion in the limit € — 0 (Takens chaos). 


2.4 Almost-adiabatic quantum dynamics and 
mixed quantum-classical molecular dynamics 


A variety of new developments in the numerics of oscillatory problems within 
the last decade were spurred by problems from quantum dynamics; see, 
e.g., [BN*96, DS03, FL06, HL99b, HL99c, HL03, Jah03, Jah04, JL03, LT05, 
NR99, NS99, Rei99]. Though these problems can formally be viewed as be- 
longing to the classes treated above, it is worthwhile to state them separately: 
time-dependent quantum dynamics close to the adiabatic limit is described 
by an equation 


ie = HY (2.7) 


with a finite-dimensional hermitian matrix H(t) with derivatives of magnitude 
~ 1 representing the quantum Hamiltonian. This is a complex Hamiltonian 
system with the time-dependent Hamiltonian function su (t) (consider 
the real and imaginary parts of ọ as conjugate variables, and take an £71- 
scaled canonical bracket). 

A widely used (though disputable) model of mixed quantum-classical me- 


chanics is the Ehrenfest model 


ü= -Va (WH (a) 


: (2.8) 
iey = H(q)b 

with a hermitian matrix H (q) depending on the classical positions q. This cor- 
responds to the Hamiltonian function sp" p+ sy (q)W. The small parameter 
E€ here corresponds to the square root of the mass ratio of light (quantum) and 
heavy (classical) particles. While this is indeed small for electrons and nuclei, 
it is less so for protons and heavy nuclei. In the latter case an adiabatic reduc- 
tion to just a few eigenstates is not reasonable, and then one has to deal with a 
quantum Hamiltonian which is a discretization of a Laplacian plus a potential 
operator that depends on the classical position. Both cases show oscillatory 
behaviour, but the appropriate numerical treatment is more closely related 
to that in Sects. 2.3 and 2.1 in the first and second case, respectively. Irre- 
spective of its actual physical modeling qualities, the Ehrenfest model is an 
excellent model problem for studying numerical approaches and phenomena 
for nonlinearly coupled slow and fast, oscillatory motion. 
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3 Building-blocks of long-time-step methods: 
averaging, splitting, linearizing, corotating 


We are interested in numerical methods that can attain good accuracy with 
step sizes whose product with the highest frequency in the system need not be 
small; see Fig. 1.1. A large variety of numerical methods to that purpose has 
been proposed in the last decade, and a smaller variety among them has also 
been carefully analysed. All these long-time-step and multiscale methods are 
essentially based on a handful of construction principles, combined in different 
ways. In addition to those described in the following, time-symmetry of the 
method has proven to be extremely useful, whereas symplecticity appears to 
play no essential role in long-time-step methods. 


3.1 Averages 


A basic principle underlying all long-time-step methods for oscillatory differ- 
ential equations is the requirement to avoid isolated pointwise evaluations of 
oscillatory functions, but instead to rely on averaged quantities. 

Following [HLW06, Sect. VIII.4], we illustrate this for a method for second- 
order differential equations such as those appearing in the previous section, 


G=f@, f(g = Ea) + feo. (3.1) 


The classical St6rmer-Verlet method with step size h uses a pointwise evalu- 
ation of f, 
Qnti — 24n + qn-1 = h? Flan), (3.2) 


whereas the exact solution satisfies 





AE ET E =# | a- 0I) flat +0h)) d0. (3.3) 


The integral on the right-hand side represents a weighted average of the force 
along the solution, which will now be approximated. At t = tn, we replace 


FCn + Oh) = SE an) + FEI (u(Ah)) 
where u(r) is a solution of the differential equation 
ü = FENG) + fP (u) . (3.4) 
We then have 
1 
rf (1— 101) (E (qn) + fH (u(0h)) ) dB = u(h) — 2u(0) +u(—=h) . (3-5) 
-1 


For the differential equation (3.4) we assume the initial values u(0) = qn and 
u(0) = ån or simply ù(0) = 0. This initial value problem is solved numeri- 
cally, e.g., by the St6rmer-Verlet method with a micro-step size +h/N with 
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N > 1on the interval [—h, h], yielding numerical approximations u™ (+h) and 
wN (+h) to u(+h) and u(+h), respectively. No further evaluations of fl! 
are needed for the computation of u (+h) and ù (+h). This finally gives the 


symmetric two-step method of Hochbruck & Lubich [HL99al, 




















qn+1 — 24n + Qn-1 = u™ (h) a 2u™ (0) +: u (=h) : (3.6) 


The method can also be given a one-step formulation, see [HLW06, Sect. VIII.4]. 
Further symmetric schemes using averaged forces were studied by Hochbruck 
& Lubich [HL99c] and Leimkuhler & Reich [LR01]. 

The above method is efficient if solving the fast equation (3.4) over the 
whole interval [—h, h] is computationally less expensive than evaluating the 
slow force flow], Otherwise, to reduce the number of function evaluations we 
can replace the average in (3.5) by an average with smaller support, 


ô 
Insia +ani =H? f KO (San) + I (u(oh)) Jad (3:7) 


—ô 


with ô < 1 and an averaging kernel K (0) with integral equal to 1. This is fur- 

ther approximated by a quadrature sum involving the values ffest] (uN (mh/N )) 
with |m| < M and 1 « M « N. The resulting method is an example of a 

heterogeneous multiscale method as proposed by E [E03] and Engquist & Tsai 

[ET05], with macro-step h and micro-step h/N. Method (3.7) is in between 

the St6rmer-Verlet method (3.2) (6 = 0) and the averaged-force method (3.6) 

(6 = 1). 

In the above methods, the slow force is evaluated, somewhat arbitrarily, 
at the particular value qn approximating the oscillatory solution q(t). Instead, 
one might evaluate f!!°“l at an averaged position Tn, defined by solving ap- 
proximately an approximate equation 


6 
ü= f(u), u(0)=qn, u(0) =0, and setting 7, = I K (0) u(Oh) dd, 
= 


with another averaging kernel K (@) having integral 1. Such an approach 
was first studied by Garcfa-Archilla, Sanz-Serna & Skeel [GSS99] for the im- 
pulse method (see below), and subsequently in [HL99a] for the averaged-force 
method, in order to reduce the sensitivity to step size resonances in the nu- 
merical solution. For that purpose, it turned out that taking 6 = 1 (or an 
integer) is essential. 


3.2 Splitting 
The Stérmer-Verlet method (see [HLW03]) can be interpreted as approximat- 


ing the flow y# of the system with Hamiltonian H(p, q) = T(p) + V(q) with 
T(p) = sp" p by the symmetric splitting 
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V T V 
Phr/2° Ph ° Ph/2: 


In the situation of a potential V = V'fast] + yislow] we may instead use a 
different splitting of H = (T + Vl®stl) + Vis!) and approximate the flow pi 


of the system by 


y [slow] T+V [fast] y [slow] 
Phj2 OPh ° Ph/2 


This is the impulse method that was proposed in the context of molecular 
dynamics by Grubmüller, Heller, Windemuth & Schulten [GH*91] and Tuck- 
erman, Berne & Martyna [TBM92]: 


1. kick: set Pr =Pn- sh VVislow] (dn) 


2. oscillate: solve į = —VV st] (q) with initial values (qn, p} ) 
over a time step h to obtain (qn+1, Pn+1) 


3. kick: set Pn41 = Ppa — shV Vow] (dn41) - 


(3.8) 


Step 2 must in general be computed approximately by a numerical integrator 
with a smaller time step. If the inner integrator is symplectic and symmetric, 
as it would be for the Stormer-Verlet method, then also the overall method is 
symplectic and symmetric. 

Garcfa-Archilla, Sanz-Serna & Skeel [GSS99] mollify the impulse method 
by replacing the slow potential V'!°“l(q) by a modified potential Vow] (q), 
where g represents a local average as considered above. 


3.3 Variation of constants formula 
A particular situation arises when the fast forces are linear, as in 


Gg = —Ax + g(q) (3.9) 


with a symmetric positive semi-definite matrix A of large norm. With Q = 
A12, the exact solution satisfies 


Ce an as 
ğ Í (aa Ta g(q(s)) ds . 


Discretizing the integral in different ways gives rise to various numerical 
schemes proposed in the literature for treating (3.9) (the earliest references 
are Hersch [Her58] and Gautschi [Gau61]). This also gives reinterpretations 
of the methods discussed above when they are applied to (3.9). We consider a 
class of trigonometric integrators that reduces to the St6rmer-Verlet method 
for A = 0 and gives the exact solution for g = 0 [HLW06, Chap. XIMI]: 


Integrators for Highly Oscillatory Hamiltonian Systems 561 
1 

In41 = COShR qn + QT sin hN Gn + ad 9(fan) (3.11) 
1 

n41 = —QAsinhD qn + cos AN ån + zL g(Ban) + Yı g(Pan+1)) - (3-12) 


Here W = yY(hN) and = (hN), where the filter functions p and ¢ are 
smooth, bounded, real-valued functions with w(0) = (0) = 1. Moreover, 
we have Yo = Wo(h2), YW = Yyı(hN) with even functions Wo, Yı satisfying 
wo(0) = yı (0) = 1. The method is symmetric if and only if 


WE) = sinc(£) Yı (£), — Yo(E) = cos(€) Yı (£) , (3.13) 


where sinc(€) = sin(&)/€. In addition, the method is symplectic (for g = 
—VU) if and only if 
P(E) = sinc(£) (8) . (3.14) 


The two-step form of the method reads 
dn+1 — 2 cos(h2) qn + dn—1 = h?Wg(fan) . (3.15) 


Various methods of Sects. 3.1 and 3.2 can be written in this way, with different 
filters Y and ®&, when they are applied to (3.9): 


w(€)=sine($é) o(6)=1 Gautschi [Gau61] and averaged method (3.6) 
w(é)=since(E) g(€)=1 Deuflhard [Deu79] and impulse method (3.8) 
w(€)=sinc?(€) o(€)=sinc(€) Garcfa-Archilla & al. [GSS99]: mollified i.m. 
w(é)=sine?(€) o(€)=1 Hairer & Lubich [HL00] 

w(é)=sine®(€) o(€)=sinc(€) Grimm & Hochbruck [GH06] 


As will be seen in Sect. 4, the choice of the filter functions has a substantial 
influence on the long-time properties of the method. 


3.4 Transformation to corotating variables 


For problems where the high frequencies and the corresponding eigenspaces 
depend on time or on the solution, as in (2.5)—(2.8), it is useful to transform 
to corotating variables in the numerical treatment. 

We illustrate the basic procedure for Schrédinger-type equations (2.7) with 
a time-dependent real symmetric matrix H(t) changing on a time scale ~ 1, for 
which the solutions are oscillatory with almost-period ~ £. A time-dependent 
linear transformation 7(t) = T-(t)W(t) takes the system to the form 


A(t) = S(t) n(t) with Se= ÈT! — alias (3.16) 


A first approach is to freeze H(t) ~ H., over a time step and to choose the 
transformation 
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Fig. 3.1. Oscillatory solution component and adiabatic variable as functions of time 


at 
T.(t) = exp( =H.) 

yielding a matrix function $,(t) that is highly oscillatory and bounded in norm 
by O(h/e) for |t — to| < h, if H, = H(to + h/2). Numerical integrators using 
this transformation together with an appropriate treatment of the oscillatory 
integrals, are studied by Hochbruck & Lubich [HL99cl, Iserles [Ise02, Ise04], 
and Degani & Schiff [DS03]. Step sizes are still restricted by h = O(e) in 
general, but can be chosen larger in the special case when the derivatives of 
+H(t) are moderately bounded. 

A uniformly bounded matrix S,(t) in (3.16) is obtained if we diagonalize 


H(t) = QH) A(t) Q(t)” 


with a real diagonal matrix A(t) = diag (A, (t)) and an orthogonal matrix Q(t) 
of eigenvectors depending smoothly on t (possibly except where eigenvalues 
cross). We define 7(t) by the unitary adiabatic transformation 


. t 

nO) = exp(=8(0) QIU) with Al) = diag (Hl) = f ACs) as, 

(3.17) 

which represents the solution in a rotating frame of eigenvectors. Such trans- 
formations have been in use in quantum mechanics since the work of Born & 
Fock [BF 28] on the adiabatic invariants I;(t) = |n;(t)|? in Schrödinger equa- 
tions. Figure 3.1 illustrates the effect of this transformation, showing solution 
components in the original and in the adiabatic variables. 

The transformation (3.17) to adiabatic variables yields a differential equa- 
tion where the ¢-independent skew-symmetric matrix 


W(t) = Q(t) Q(t) 
is framed by oscillatory diagonal matrices: 
OE exp(=o()) W(t) exp(-+9()) n(t). (3.18) 


Numerical integrators for (2.7) based on the transformation to the differential 
equation (3.18) are given by Jahnke & Lubich [JL03] and Jahnke [Jah04]. The 
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simplest of these methods freezes the slow variables n(t) and W (t) at the mid- 
point of the time step, makes a piecewise linear approximation to the phase 
(t), and then integrates the resulting system exactly over the time step. This 
gives the following adiabatic integrator: 


1 : 
Tn+1 = M + hB(tn+41/2) z +n+1) with (3.19) 


B(t) = (e(t (t) = dx(t))) sine( ŽO) — dx(4))) wat) 


j,k 


More involved — and substantially more accurate — methods use a Neumann 
or Magnus expansion in (3.18) and a quadratic phase approximation. Numer- 
ical challenges arise near avoided crossings of eigenvalues, where 7(t) remains 
no longer nearly constant and a careful choice of step size selection strat- 
egy is needed in order to follow the non-adiabatic transitions; see [JLO3] and 
[HLW06, Chap. XIV]. 

The extension of this approach to (2.5), (2.6), and (2.8) is discussed in 
Sect. 5. The transformation to adiabatic variables is also a useful theoretical 
tool for analysing the error behaviour of multiple time-stepping methods ap- 
plied to these problems in the original coordinates, such as the impulse and 
mollified impulse methods considered in Sect. 3.2; see [HLW06, Chap. XIV]. 


4 Trigonometric integrators for problems with nearly 
constant frequencies 


A good understanding of the behaviour of numerical long-time-step methods 
over several time scales has been gained for Hamiltonian systems with almost- 
constant high frequencies as considered in Sect. 2.1. We here review results 
for single-frequency systems (2.1) with (2.2) (and M = J) from Hairer & 
Lubich [HL00] and [HLW06, Chap. XIII], with the particle chain of Fig. 2.1 
serving as a concrete example. The variables are split as q = (qo, q1) according 
to the blocks in (2.2). We consider initial conditions for which the total energy 
H (p,q) is bounded independently of €, 


H(p(0), q(0)) < Const. 


The principal theoretical tool is a modulated Fourier expansion of both the 
exact and the numerical solution, 


g(t) = Ep 2*(t) i", (4.1) 


an asymptotic multiscale expansion with coefficient functions z*(t) changing 
on the slow time scale 1, which multiply exponentials that oscillate with fre- 
quency 1/e. The system determining the coefficient functions turns out to 


564 D. Cohen, T. Jahnke, K. Lorenz, Ch. Lubich 


have a Hamilton-type structure with formal invariants close to the total and 
oscillatory energies. 

The results on the behaviour of trigonometric integrators (3.15) on differ- 
ent time scales have been extended from single- to multi-frequency systems 
(possibly with resonant frequencies) by Cohen, Hairer & Lubich [CHL05], and 
to systems (2.4) with non-constant mass matrix by Cohen [Coh04, Coh06]. 


4.1 Time scale € 


On this time scale the system (2.1) with (2.2) only shows near-harmonic oscil- 
lations with frequency 1/¢ and amplitude O(e) in the fast variables qı, which 
are well reproduced by just any numerical integrator. 


4.2 Time scale £? 


This is the time scale of motion of the slow variables go under the influence 
of the potential U(q). Here it is of interest to have an error in the numerical 
methods which is small in the step size h and uniform in the product of the 
step size with the high frequency 1/e. The availability of such uniform error 
bounds depends on the behaviour of the filter functions 7 and ¢ in (3.15) at 
integral multiples of 7. Under the conditions 


(Qk) = y'(2kr) =0, o((2k—1)t)=0, G(2kr)=0 (4.2) 


for k = 1,2,3,..., it is shown in [HLW06, Chap. XIII.4] that the error after 
n time steps is bounded by 


lan — q(nh)|| < Ch?, |lGn —G(nh)|| < Ch for nh< Const., (4.3) 


with C independent of h/c and of bounds of derivatives of the highly oscilla- 
tory solution. 

Error bounds without restriction of the product of the step size with the 
frequencies are given for general positive semi-definite matrices A in (2.1) by 
Garcia-Archilla, Sanz-Serna & Skeel [GSS99] for the mollified impulse method 
(W(€) = sinc?(€), o(€) = sinc(€)), by Hochbruck & Lubich [HL99a] and 
Grimm [Gri05a] for Gautschi-type methods (7)(€) = sinc?(€/2) and suitable 
@), and most recently by Grimm & Hochbruck [GH06] for general A and 
general classes of filter functions Y and @. 


4.3 Time scale e71 

An energy exchange between the stiff springs in the particle chain takes place 
on the slower time scale e~'. To describe this in mathematical terms, let q1,j 
be the jth component of the fast position variables qı, and consider 


isy 


= 2 
= z di, j + Dee dij» 
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which in the example represents the harmonic energy in the jth stiff spring. 
The quantities J; change on the time scale e71. To leading order in €, their 
change is described by a differential equation that determines the coefficient 
of e/* in the modulated Fourier expansion (4.1). It turns out that for a 
trigonometric method (3.15), the differential equation for the corresponding 
coefficient in the modulated Fourier expansion of the numerical solution is 
consistent with that of the exact solution for all step sizes if and only if 


YCE) pE) = sinc(€) forall € 2 0. 


It is interesting to note that this condition for correct numerical energy ex- 
change is in contradiction with the condition (3.14) of symplecticity of the 
method, with the only exception of the impulse method, given by w = 
sinc, o = 1. That method, however, does not satisfy (4.2) and is in fact 
extremely sensitive to near-resonances between frequency and step size (h/e 
near even multiples of 7). A way out of these difficulties is to consider trigono- 
metric methods with more than one force evaluation per time step, as is shown 
in [HLW06, Chap. XII]. 

In Fig. 4.1 we show the energy exchange of three numerical methods for 
the particle chain of Fig. 2.1, with € = 0.02 and with potential and initial 
data as in [HLW06, p. 22]. At t = 0 only the first stiff spring is elongated, 
the other two being at rest position. The harmonic energies J1, I2, I3, their 
sum I = I, + Ig + Is, and the total energy H (actually H — 0.8 for graphical 
reasons) are plotted along the numerical solutions of the following methods, 
with step sizes h = 0.015 and h = 0.03: 


(A) impulse method (7 = sinc, ¢ = 1) 
(B) mollified impulse method (7 = sinc”, ¢ = sinc) 
(C) heterogeneous multiscale method (3.7) with 6 = Vé. 
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Fig. 4.1. Energy exchange between the stiff springs for methods (A)-(C), with 
h = 0.015 (upper) and h = 0.03 (lower), for e = 0.02 
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For this problem with linear fast forces, the averaging integrals of Sect. 3.1 
and the solution of (3.8) are computed exactly. We notice that for the larger 
step size only method (A) reproduces the energy exchange in a correct way. 
Method (C) behaves very similarly to the Stérmer-Verlet method. 


4.4 Time scales e~N with N > 2 


In Fig. 4.1 it is seen that the total oscillatory energy J remains approximately 
conserved over long times. Along the exact solution of the problem, T is in 
fact conserved up to O(<) over exponentially long times t < e°/*; see Benettin, 
Galgani & Giorgilli [BGG87] and Cohen, Hairer & Lubich [CHL03] for differ- 
ent proofs based on canonical transformations of Hamiltonian perturbation 
theory and modulated Fourier expansions (4.1), respectively. Along the nu- 
merical solution by a trigonometric integrator (3.15), near-conservation of the 
oscillatory energy I and the total energy H are shown in [HL00] and [HLW06, 
Chap. XIII] over times t < e~% under a non-resonance condition between the 
frequency and the step-size: 


sin(=)| >cVh_ for k=1,...,N. 
E 





It is known from [HL00] that the condition 


YE) = sine” (£) o(€) 


is necessary to have long-time conservation of the total energy uniformly for 
all values of h/e, and numerical experiments indicate that this condition on 
the filter functions may also be sufficient. Otherwise, energy conservation is 
lost at least when h/e is close to an even multiple of r. 

Figure 4.2 shows, for the same data and methods as before, the maximum 





























deviations of H and I on the interval [0, 1000] as functions of h/e. 
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Fig. 4.2. Maximum deviation of total energy (upper) and oscillatory energy (lower) 
as functions of h/e (for step size h = 0.02) 
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5 Adiabatic integrators for problems with varying 
frequencies 


Adiabatic integrators are a novel class of numerical integrators that have been 
devised in [JL03, Jah04, Jah03, LJLO5, Lor06, HLW06] for various kinds of 
oscillatory problems with time- or solution-dependent high frequencies, in- 
cluding (2.5)-(2.8). These integrators have in common that the oscillatory 
part of the problem is transformed to adiabatic variables (cf. Sect. 3.4) and 
the arising oscillatory integrals are computed analytically or approximated by 
an appropriate expansion. The methods allow to integrate these oscillatory 
differential equations with large time steps in the adiabatic regime of well- 
separated frequencies and follow non-adiabatic transitions with adaptively 
refined step sizes. 


5.1 Adiabatic integrators for quantum-classical molecular 
dynamics 


Following Jahnke [Jah03], we sketch how to construct a symmetric long-time- 
step method for problem (2.8), which couples in a nonlinear way slow motion 
and fast oscillations with frequencies depending on the slow variables. 

Proceeding as in Sect. 3.4, the quantum system is transformed to adiabatic 
variables by 


ult) = exp (490) (ato) "WO, (5.1) 


where H(q) = Q(q)A(q)Q(q)" is a smooth eigendecomposition of the Hamil- 
tonian and 


a) = | A(q(s)) ds, & = diag(¢;) (5.2) 


to 


is the phase matrix, a diagonal matrix containing the time integrals over the 
eigenvalues A,(q(t)) along the classical trajectory. Inserting (5.1) into (2.8) 
yields the new equations of motion 


ü = —n* exp (ż2) K (q) exp (+2) n, (5.3) 
7) = exp | z2 | W(q, 4) exp | -z8 | n (5.4) 
with the tensor K (q) and skew-symmetric matrix W (q, ġ) given as 
K (4) = Q(4)" VH (a) Q(q); 
T 


waa = (Z0) eW = (wea 65) 
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Equations (5.3) and (5.4) can be restated as 


ï = -n (E@) + K(q))n, (5.6) 
ù = (E(®) e Wa.) (5.7) 


where e means entrywise multiplication, and E(®) denotes the matrix 


E(@) = (cin(®)) ,  €jk(®) = exp (Le = d) . (5.8) 


The classical equation can be integrated using the averaging technique from 
Sect. 3. We insert (5.6) into (3.3) and, in order to approximate the integral, 
keep the smooth variables ņn(t) and K(q(t)) fixed at the midpoint tn of the 
interval [tn—-1,tn+1]. Since more care is necessary for the oscillating exponen- 
tials, we replace ® by the linear approximation 


D(tn + Oh) = B(tn) + OhA(qltn))- (5.9) 


These modifications yield the averaged Stérmer-Verlet method of [Jah03}: 


n41 — 24n + dna = -h?m (E(@n) *Z(dn)® K (dn) tm (5-10) 


1 
wit Ee = / (1 —|6|) E(@hA(gn)) d8. 

-1 

The matrix of oscillatory integrals Z(qn) can be computed analytically: its 
entries Tjk(qn) are given by 


1 
i= J (1 — Øl) exp(i0é;4) dé = sinc? (Zé;x) 


with £e = = (Ajlan) — Ae (an). (5.11) 


Note that in the (computationally uninteresting) small-time-step limit h/e > 
0 the integrator (5.10) converges to the St6rmer-Verlet method. 

The easiest way to approximate the quantum vector 7(t) is to keep 
n(t) = 7(0) simply constant. According to the quantum adiabatic theorem 
[BF 28] the resulting error is only O(¢) as long as the eigenvalues of H(q(t)) 
are well separated and the eigendecomposition remains smooth. A more re- 
liable method, which in its variable-time-step version follows non-adiabatic 
transitions in 7 occurring near avoided crossings of eigenvalues, is obtained 
by integrating Eq. (5.7) from tn — h to tn + h, using the linear approxima- 
tion (5.9) for &(t), and freezing the slow coupling matrix W (q(t), q(t)) at the 
midpoint tn. This yields the adiabatic integrator from [Jah03}, 
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i E =2h(E@n) © F(an)* Wn) Nn (5.12) 


with J (qn) = >| E(@hA(qn)) dd. 


-1 


The (j, &)-entry of the matrix of oscillatory integrals J (qn) is simply 


1 1 
Tj (Qn) = 5 / _OxP (Es) d0 = sinc jx. 


The explicit midpoint rule is recovered in the limit h/e — 0. The derivative 
contained in (5.5) and the integral in (5.2) are not known explicitly but can 
be approximated by the corresponding symmetric difference quotient and the 
trapezoidal rule, respectively. These approximations are denoted by Wn and 
@,, in the above formulas. 

The approximation properties of method (5.10), (5.12) for large step sizes 
up to h < /é are analysed in [Jah03]. A discrete quantum-adiabatic theorem 
is established, which plays an important role in the error analysis. 


5.2 Adiabatic integrators for problems with time-dependent 
frequencies 


Adiabatic integrators for mechanical systems with a time-dependent multi- 
scale Hamiltonian (2.5) are presented in [HLW06, Chap. XIV] and [Lor06], 
following up on previous work by Lorenz, Jahnke & Lubich [LJL05] for sys- 
tems (2.5) with M(t) = I and A(t) a symmetric positive definite matrix. To 
simplify the presentation, we ignore in the following the slow potential and 
set U = 0. 

The approach is based on approximately separating the fast and slow time 
scales by a series of time-dependent canonical linear coordinate transforma- 
tions, which are done numerically by standard numerical linear algebra rou- 
tines. The procedure can be sketched as follows: 


e The Cholesky decomposition M(t) = C(t)~7 C(t) and the transformation 
q +> C(t)q change the Hamiltonian in such a way that the new mass matrix 
is the identity. 

e The eigendecomposition 


AW) = 20 (9 aye) QO, 20 = diagluy(0) 


of the symmetric stiffness matrix A(t) allows to split the positions q = 
(go, q1) and momenta p = (po, pı) into slow and fast variables qo, po and 
qı, pı, respectively. 

e The fast positions and momenta are rescaled by ¢~!/?Q(t)!/? and by 
Et/2 Q(t) 712, respectively. 
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e The previous transforms produce a non-separable term q’ K(t)p in the 
Hamiltonian. One block of the matrix K (t) is of order O(e~!/?) and has 
to be removed by one more canonical transformation. 


The Hamiltonian in the new coordinates p = (po,p1) and q = (qo, q1) then 
takes the form 


1 1 1 1 
H(p,q,t) = 5 Po Po + zg P1 Ae) p1 are qT Atha + qT L(t)p+ 3 q7 S(t)q 


with a lower block-triangular matrix L and a symmetric matrix S of the form 


L= Lo =O g= Soo Et? Soi 
~ NeW? Eig Li)’ ~ Ne? S19 eS j’ 
Under the condition of bounded energy, the fast variables qı and pı are now 
of order O(e!/?). The equations of motion read 


po = folp,q, t) 
go = Po + go(q, t) 


(a) == (ato an) (a) * Cota) 
with functions 


a = —L(t)p — S(t)q, e = L(t)"q, 


gı 


which are bounded uniformly in e. The oscillatory part now takes the form of 
a skew-symmetric matrix multiplied by 1/e, similar to (2.7). We diagonalize 
this matrix and define the diagonal phase matrix ® as before: 


(aP) rari) em 


Q(t) O0 ) I 
A(t) = ; P(t) = A(s) ds. 5.14 
Cen ane w= f a) (5.14) 
The transformation to adiabatic variables is now taken as 
= eV exp -t0 ) r (7! 5.15 
n= e2 exp(-t00)) r (™ (5.15) 


with the factor e—!/? introduced such that 7 = O(1). The equations of motion 
become 





Po = —Loopo — Soogo — €$01Q17 
Go = Po + Lodo + eLtyQin (5.16) 
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for the slow variables, and 


n= exp(-+9) W exp(Ż0) n — Př (Liopo + $1090) (5.17) 


for the adiabatic variables, where 


=, ps [Li —ES11 Pr\ _ i 
W=r ( 0 EE E @ =exp(+0). 


Slow and fast degrees of freedom are only weakly coupled, because in the 
slow equations (5.16) the fast variable 7 always appears with a factor £e. The 
oscillatory part has the familiar form of a coupling matrix framed by oscilla- 
tory exponentials, cf. (3.18) and (5.4). Under a separation condition for the 
frequencies w,(t), the fact that the diagonal of W is of size O(e) implies that 
the expressions J; = |n;|? are adiabatic invariants. I; is the action (energy 
divided by frequency) 


1 sl 
es (5 20 ji 2 ). 
i a ND Pi, j + 92 Lj 


An adiabatic integrator for (2.5) is obtained by the following splitting (for 
details see [HLW06, Chap. XIV] and Lorenz [Lor06]): 


1. Propagate the slow variables (po, qo) with a half-step of the symplectic 
Euler method. For the oscillatory function Q:(t), replace the evaluation 
at tn+1/2 = tn +h/2 by the average 


2 tn+1/2 
t 


n 


obtained with a linear approximation of the phase (t) and analytic com- 
putation of the integral. 

2. Propagate the adiabatic variable 7 with a full step of a method of type 
(3.19) for (5.17). 

3. Propagate the slow variables (po, qo) with a half-step of the adjoint sym- 
plectic Euler method, with an appropriate average of Q1 (t). 


The approximation properties of this method are analyzed in [HLW06, Chap. XIV], 
where it is shown that the error over bounded time intervals, in the original 
variables of (2.5), is of order O(h?) in the positions and O(h) in the momenta, 
uniformly in £ for h < y£. Numerical comparisons with other methods illus- 
trate remarkable benefits of this approach [Lor06, LJLO5]. 

We present numerical illustrations from Lorenz [Lor06] for the time- 
dependent Hamiltonian (2.5) with M(t) = J and 


t+3 8 V 
Alt) = ( 5 aa 
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Fig. 5.1. Frequencies w; (upper) and ||Q]| (lower) for 6 = 1, 0.1, 0.01 


on the time interval [—1, 1]. The behaviour of the components of the solution 
q(t) and the adiabatic variable 7(t) are as in Fig. 3.1 for 6 = 1 and 7 = 0.01. 

Figure 5.1 shows the frequencies and the norm of the time derivative of 
the matrix Q(t) that diagonalizes A(t) for various values of the parameter ô. 
For small values of 6, the frequencies approach each other to O(d) at t = 0, 
and ||Q(0)|| ~ 8-1. This behaviour affects the adiabatic variables 7; (t), as is 
shown in Fig. 5.2. 

For 6 ~ e!/?, there appears an O(1) change in 7 in an O(d) neighbourhood 
of t = 0, and for smaller values of 6 the components of 7 essentially exchange 
their values; cf. Zener [Zen32] for the analogous situation in Schrödinger- 
type equations (2.7). Small step sizes are needed near t = 0 to resolve this 
behaviour. Figure 5.3 shows the step sizes chosen by a symmetric adaptive 
step selection algorithm described in [HLW06, Chap. XIV] for different values 
of 6. Errors of similar size are obtained in each case. 


5.3 Integrators for motion under a strong constraining force 


The methods and techniques of the previous subsection can be extended to 
problems (2.6) with solution-dependent high frequencies [HLW06, Lor06]. The 
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Fig. 5.2. Adiabatic variables 7; as functions of time 
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Non-adiabatic transition: step sizes as function of t 








step size h 














time t 


Fig. 5.3. Step size vs. time for ô = 2°, 2~*, 274, 27° (increasingly thicker lines) 


procedure is again to apply a series of canonical coordinate transformations 
to transform (numerically) to a system with nearly-separated slow and fast 
components: 


: 1 = 
Bo = — Va ( 5 P3 Mo (ao) "po + U(qo,0)) 
-V (= pT (a0) + =F Q(a0)a1) + folo.4) 
AOT al j 
Go = Molqo)™ po + go(p, q) (5.18) 


a = ~( 0 a (2) y Gay 

ni) e\2q) 0 qı gı(p,q) 

with the diagonal matrix (qo) of frequencies w; (qo) and smooth functions of 
magnitude fy = O(e), go = Ole) and fı = O(e!/?), gı = O(e!/?) in the case 
of bounded energy and well-separated frequencies. The fast motion of (p1, qi) 
is followed numerically in the adiabatic variables 7, which are again defined 
by (3.17). In these coordinates, the integrator used is then similar to those 
of Sects. 5.1 and 5.2. Alternatively, the system is integrated in the original 
coordinates by multiple time-stepping methods such as the (mollified) impulse 
method of Sect. 3.2, and the above-mentioned transformations are only used 
as a theoretical tool for analysing the numerical method. We refer to [HLW06, 
Chap. XIV] for more details. 

The actions I; = |n;|* are again adiabatic invariants in the case of well- 
separated frequencies, remaining O(eV (m+1)) close to their initial values over 
bounded time intervals if additionally all the expressions wj + wz + w, have 
zeros of multiplicity at most m. It is worthwhile to note that the oscillatory 
energy appearing in (5.18) can be written as 














1 1 
zg P1 A(go)Pr + zz at 2Co) = X; Lj wla) 


so that the limit equation of the slow variables for € — 0 takes the form first 
discovered by Rubin & Ungar [RU57], 
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; 1 = 
Po = — Vo (5 po Mo(q0) po + U(qo,0) + 0, Tj 12; (40)) 


Go = Mo(qo)~ ‘po 


with the oscillatory energy acting as an extra potential. However, as was noted 
by Takens [Tak80], the slow motion can become indeterminate in the limit £ > 
0 when the frequencies do not remain separated; see also Bornemann [Bor98}. 
In contrast to the integration of the slow limit system with constant actions 
Ij, the numerical integration of the full oscillatory system by an adiabatic 
integrator with adaptive time steps detects changes in the actions. Moreover, it 
can follow an almost-solution (having small defect in the differential equation) 
that passes through a non-adiabatic transition. 
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Summary. Energy level crossings are the landmarks that separate classical from 
quantum mechanical modeling of molecular systems. They induce non-adiabatic 
transitions between the otherwise adiabatically decoupled electronic level spaces. 
This review covers results on the analysis of propagation through level crossings of 
codimension two, a mathematical justification of surface hopping algorithms, and a 
spectral study of a linear isotropic system. 


1 Introduction 


Molecular systems are a prime example of a multiscale problem. The light 
electrons move rapidly, in a highly oscillatory fashion, while the nuclei, as the 
heavier parts of the molecule, move much slower. T his separation of mass 
and subsequently time and space scales is at the heart of Born-Oppenheimer 
approximation. It allows for a drastic reduction of problem size when dealing 
with molecular systems. 

A quantum mechanical, non-relativistic description of a molecule is given 
by the molecular Schrödinger operator 


N n 
ee Sie ne ae P| 
mol 2M; ^ dj Qn; 
j=l j=l 
= Z;Zx\q; a Zp |x; =e 
=F |£; — £k] + jZkldj — kl — kl£j — el 
j<k j<k j,k 


where the vectors q and x denote the positions of N nuclei and n electrons, 
and Mj, Z; denote mass and charge of the jth nucleus. For the simplicity of 
notation, one assumes that the nuclei have identical mass M and introduces 


a scale parameter 
0<e=V1/M «1, 


which is of the order 10~?, typically. One rewrites the operator as 
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e2 
Hmol = -7^4 T Halq), 


where the electronic Hamiltonian He(q) acts, for a fixed nuclear configuration 
q, on the electronic degrees of freedom only. 

The first step of Born-Oppenheimer approximation consists in solving the 
electronic eigenvalue problem for all nuclear configurations, 


Vq : Hea(q)x(q, x) = E(q)x(q, x). 


For the second step, one assumes that the electronic energy levels of in- 
terest are uniformly separated from the remainder of the electronic spectrum. 
That is, if one is interested in two levels E~ (q) and E+ (q), 


vq: dist({E*(q), E~ (4)}, o(Hala)) \ {E7 (0), EB (@)}) > 6 


for some 6 > 0. Then, one looks for a diabatic basis {¥® (q, x) } of the electronic 
subspace span{x*(q,x)}, such that the mapping q +> X= (q, £) is smooth. If 
one replaces the Coulomb interactions inbetween nuclei, and between nuclei 
and electrons, by a mollified charge distribution, then the electronic Hamilto- 
nian depends smoothly on X, which guarantees existence of a diabatic basis. 
A diabatic basis {X= (q, x)} is expected to be different from the adiabatic basis 
{x= (q, x)} if the electron energy levels E*(q) have the same symmetry, see 
[LS06]. Given a diabatic basis, one builds a hermitian matrix 


= (V--(4) VA® 
vo = (voy vty): 


whose entries consists of the expectation values of the electronic Hamiltonian 
with respect to the diabatic basis functions, 


Vila) = (X (a, J, Hala) X (a, ‘)) 12, , k,l € A +}- 























The Born-Oppenheimer Hamiltonian is then given as 
e2 
It is a two-level Schrödinger operator acting only on the nucleonic degrees 


of freedom. Let X(q, x) = (Xt (q, x), X (q, x). If y(q,t) is a solution of the 
time-dependent Born-Oppenheimer problem 


ied = Hgov, plq, 0) = pola), (1.1) 


then Y(q, t) - X(q, £) is an approximate solution of the full molecular problem 
ic, U = Ano, W(q, T, 0) = pola) ` X(q, x) 


by an error of order € as € — 0, see [ST01]. 
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Fig. 1.1. The energy levels of the model for the cis-trans isomerization of retinal in 
rhodopsin of [HS00]. They cross, when the angular variable ¢ is approximately 5 or 
a and the collective coordinate vanishes. In the plot, the abscissa corresponds to 
$, the ordinate to y. The two local minima of the lower energy level are associated 
with the cis and the trans configuration of the molecule. (See page 694 for a colored 
version of the figure.) 


For polyatomic molecules, which consist of more than two nuclei, one has to 
expect the crossing of electron energy levels, that is the existence of nucleonic 
configurations q with Et(q) = E~ (q). These crossings, or more precisely, 
those with a crossing manifold of codimension two or higher, 


codim{g € R*; E*(q) = E~ (q)} > 2, 


induce non-adiabatic transitions that are of leading order in € as € — 0: Denote 
the orthogonal eigenprojectors of the matrix V (q) by I7*(q). There is a large 
set of initial data wo with 











T*y=0 and t>0: M+(e~#ot/ey) = O(1), € 0. 





That is, the solution of the two-level system performs a non-adiabatic tran- 
sition from the eigenspace associated with the eigenvalue E*(q) to the one 
associated with E*(q). 

Non-adiabatic transitions are typically linked with ultrafast isomerization 
processes, radiationless decay, or molecular collisions. The most spectacular 
example of a femtosecond isomerization modelled by an electron level crossing, 
the cis-trans isomerization of retinal in rhodopsin, is the first step of vision, 
see [HS00]. Rhodopsin is the light-absorbing pigment of the rods, which are 
responsible for the acute, but coarse colorless vision. The model incorporates 
two electronic levels E~(q) and E*(q) and considers two nucleonic degrees 
of freedom q = (¢,y), an angular variable ¢ € T and a collective coordinate 
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Zor d= and y = 0, see 


y € R. The levels E*(q) cross twice, when ¢ ~ $ 5 

Fig. 1.1. The two local minima of E~ (q) are associated with the cis and the 
trans configuration of the molecule; the lower steeper one represents the stable 
cis configuration and the higher flatter one the unstable trans configuration. 
Initially, the wave function is localized in the cis minimum. After photons 
excited the wave function vertically to the upper level E*(q), it runs down 
to approach the two crossing points and performs a non-adiabatic transition 
there, down to the trans minimum of the lower level E~ (q). This isomerization 
is considered the first step in a chain of events that culminate in a change of 
the impulse pattern sent along the optic nerve. 

Generally, physical models incorporate more than two nucleonic degrees 
of freedom. The numerical solution of Schrödinger equations with high di- 
mensional configuration spaces, however, is a challenging task. A grid based 
representation of the wave function scales exponentially in the number of 
space dimensions. Since the wave function is highly oscillatory, with oscilla- 
tions in space and time of about the order e€, a full grid based discretization in 
four space dimensions is still considered at the borderline of current computer 
power. 

On the other hand, the wave function itself does not have any direct phys- 
ical interpretation. Meaningful are quadratic quantities of the wave function 
like the position density or expectation values. A suitable vehicle for encod- 
ing quadratic quantities of a wave function 7 € L?(IR¢,C?) is the associated 
Wigner function 





Wap) = On)" f e'™™vq— seat Sudy, (Gp) ER 


which is a function on phase space R? taking values in the space of hermi- 
tian 2 x 2-matrices. We refer to Chapter 1.8 in [Fol89] for an exposition of 
the basic properties. Here, we will restrict ourselves to mentioning marginal 
distributions and the relation to expectation values. 

Integration of the Wigner function with respect to momentum and position 
space result in position and momentum density, respectively, 


| tr (WWA p) dp = WW), | tr (W ()(q,p)) dq = (2r€)~* Ro. 
Rd Rd 


Recall that Weyl quantization associates with a smooth, compactly supported 
function on phase space, a € C§° (R24, C?*?), a bounded operator a(q, —ieVq) 
on the Hilbert space L?(IR%,C?), whose action is defined by 


a(a.—ieV, (a) = f Palla + user) Hlu)dude, 


The expectation values of a Weyl quantized operator is encoded by the Wigner 
function, too: 
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f," OV Wa p)ala p) dadp = (9, ala, —ieVg)) r2 


The last identity, combined with the theorem of Calderón-Vaillancourt that 
asserts that the operator norm of a(q,—ieV,) is bounded by a finite sum of 
sup-norms of derivatives of a, allows one to view the Wigner function as a 
distribution, 


W(4): am i tr (W (Y)(q, p)a(q, p)) dadp. 


Let w be the solution of the two-level system (1.1). If one considers the 
Wigner matrix W (y) with respect to an eigenbasis of the eigenvalues E~(q), 
the diagonals of the matrix W (y) show a much more favorable behavior as 
compared to the highly oscillatory wave function vw. In the semiclassical limit 
€ — 0 the diagonals can approximately be described by classical transport 
and a non-adiabatic transfer of weight between them, see Theorem 2.2 below. 
Hence, level populations, that is 














IEOR = f TOWO p) (la)dadp 





or other quadratic quantities related to the projected wave functions =y (t), 
can be computed efficiently, even in high dimensional situations. 


2 Analysis of the dynamics 
The mathematical analysis of time-dependent two-level Schrödinger systems 
ied = (-FA +V) vv) = HH E RLC) (21) 


with crossing eigenvalues has been pioneered by Hagedorn [Hag94]. For time- 
reversible molecular systems the potential matrix is real-symmetric, 


V(q) aik te v(a) ) | (2.2) 


v2(q) —v1(q) 


where w,v1,V2 E€ C®(R?, R) are smooth, real-valued functions with decay 
properties guaranteeing the essential self-adjointness of the Hamilton operator 


2 
H=-5A,+ V(q). 
Denoting v(q) = (v1(q), v2(q))’, the matrix V(q) has the eigenvalues 


w(q) + yvi (q)? + v2(q)? = w(q) = |v()|- 
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The crossing manifold {q € R4; v(q) = 0} of coinciding eigenvalues has codi- 
mension two if one assumes 


v(q)=0 => rank Do(q) = 2, 


where Du(q) = (Vqv1(q), Vqv2(q)). These crossings are called conical inter- 
sections in the chemical physics literature, see [DY K04]. They seem to be the 
most commonly studied type of level crossings. 

The mathematical results on the dynamics near conical intersections rely 
on the smallness of the semiclassical parameter 0 < e < 1, giving dynam- 
ical descriptions that are asymptotic with respect to e — 0. They fall into 
three groups: the propagation of Gaussian wave packets [Hag94], of two- 
scale Wigner measures [FG02, FG03, Fer03, FL03], and of Wigner functions 
[LT05, FLO6]. Resolving the non-adiabatic transitions at the crossing mani- 
fold, all approaches rely on some kind of normal form, which in its essence is 
the Landau-Zener operator [Zen32] 


-ia (5 7), yER. 


ys 


In the framework of microlocal analysis, that is, locally in phase space, 
normal forms for generic level crossings have been derived by Fermanian Kam- 
merer and Gérard [FG02, FG03]. T he microlocal normal forms of Colin de 
Verdiére [CdV03, CdV04] even allow for superpolynomial error estimates. 


2.1 Heuristics 


An intuitive, but non-rigorous argument that shows the Landau-Zener oper- 
ator to essentially gear the dynamics through conical intersections has been 
given for the linear isotropic potential 


V(iq) = & 1 ) (2.3) 


q2 —qı 
by Teufel and the second author [LT05]. We will adapt these heuristics to 
general conical intersections in the following. We note, that they result in an 
explicit formula for the gap y, which will be a crucial ingredient of the effective 
asymptotics and the subsequent numerical algorithm that we aim for. 


The Schrödinger operator H is the semiclassical Weyl quantization of the 
matrix-valued symbol 


R” > R??, (q,p) + 4 |p|? + V(q). 


Given its eigenvalues 





R >R, (a p)=> §lp|? + wla) + |o(9)| 
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one associates the two Hamiltonian systems 


(25, pV Si (2.4) 





and the corresponding classical flows $., which are well-defined away from 
the crossing manifold. 

As a first step, one formally inserts a classical trajectory (q(t), p(t)) of one 
of the Hamiltonian systems into the trace-free part of the symbol of the full 
operator, obtaining the purely time-dependent problem 


tn — [rala®) w(@®) 
oe) oy ay oy): (2.5) 


Such systems show non-adiabatic transitions in the region, where the gap be- 
tween the eigenvalues is minimal (see e. g. [Bor98, HJ04, BT05]). A necessary 
condition for the gap between the eigenvalues to become minimal along the 
chosen trajectory is 





4 lv(a(t))|? = 0. 
This condition is satisfied if the trajectory passes the hypersurface 
S = {(q,p) € R**; Do(q)p- v(q) = 0}. 


Let a € [—1, 1] be an angle to be determined later. A conjugation by the 
half-angle rotation matrix 
a 3 -= a) 
sin p COS p 


transforms problem (2.5) to 


in _ f (cosa,sina)* -v(q(t)) (cosa,sina)* A v(q(t)) 
ca (= a, sina)’ A v(q(t)) —(cos a, sin a) can o(0), 


where z A y = 21 y2 — £241 denotes the symplectic product of the two vectors 
x,y € R?. Assuming that the chosen trajectory passes the hypersurface of 
minimal gap through the point (qx, px) at time t = 0, one linearizes, 


v(q(t)) = v(qx) a tDv(qx) Px + O(t?). 


Aiming at a Landau-Zener problem with the diagonals linearly depending on 
time and the off-diagonals constant, one chooses the rotation angle as 


a= arccos( Taata) 2 ) = aresin( arela) 2) ; 


|Dvu(d«)px| |Du(q«) P| 


Since Dv(q.)ps + v(qx) = 0, the linearized system then reads as 
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Dv (qu )pxAv(qx 
Dia —tDv(d«)p«| 


If |Dv(qx)p«| >> 0, one sets a new semiclassical parameter č = €/|Dv(q.)px|, 
and obtains the Landau-Zener problem 


w(t) = (5 7.) of 


with gap 
x Du(qx )px A v(qx) 
|Du(d«)ps|? 


Let ¢*(t) denote the components of the vector ¢(t) with respect to the 
eigenbasis of the Landau-Zener matrix, and put ¢*(+00) = limy +o o*(t). 


(isle G@) oo (FC) =). 


(Ge) = Car) (pce) 


with a transition rate 




















then 





T ~ exp(-2?) ; ée— 0. 
č 


In particular, the Landau-Zener rate T exhibits that the non-adiabatic tran- 
sitions are of leading order in € if the chosen trajectory experiences a gap y 
that is of order vē. 


Remark 2.1. We note that even though the time-dependent problem (2.5) we 
started with is formulated just in terms of the position coordinate q(t), the 
resulting Landau-Zener problem has a gap y that depends on phase space 
information, namely the point (q.,p«) at which the trajectory attains the 
minimal gap between the eigenvalues w(q) + |v(q)|- 





2.2 Branching process 


The heuristics motivates the following definition of random trajectories and a 
corresponding Markov process for effectively describing the dynamics through 
conical intersections. 

One attaches to points (q,p) € R4 in phase space a label —1 or +1, 
indicating reference to the eigenvalue w(q) — |v(q)| or w(q) + |v(q)|. Moreover, 
one chooses a positive number R > 0, defining the set 


{(q,p) € R74; |v(q)| < Rve} 
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as a distinguished tubular neighborhood of the crossing manifold. For labelled 
phase space points (q, p, j) € R?4 x {—1, +1}, one sets 

T4P:3) : (0, +00) > R°? x {-1, +1} 
such that 742:3) (t) = ($ (q, p), j) as long as 


WWE) > Rve or Dold E) (t) -vl (t)) 4 0. 
A jump from j to —j occurs with probability 


T (Du(qu)D« ^ v(qx))? 
aS ee € |Dv(dx)p«/ 


whenever ®5(q,p) hits the manifold of minimal gap 


S = {(q,p) € R**; Dv(g)p- v(q) = 0}. 


at time t in a point (qx, px) E€ S with |v(qx)| < Re. 

The randomized evolution T‘%?-J)(t) defines a Markov process. The asso- 
ciated backwards semi-group £t is given by its action on a class of continuous 
scalar-valued functions a = a(q,p,j) satisfying suitable boundary conditions 
at the manifold of minimal gap S, 


Lia(q,p, j) = BOP a (TOP), 





(2.6) 


see [LT05, FL06]. This definition naturally extends to matrix-valued functions 
of the form a = at H+ +a7 I~ with a* € C5°(R?4\S,C), that is, to functions 
that commute with the potential matrix V. By duality, the semigroup acts on 
Wigner functions also, 








L'W(Y): a i (W (w)(q,p)(La)(q,p)) dadp. 


Theorem 2.2 ([LT05, FL06]). Let (Y§)e>o be a bounded sequence in L? (Rt, C?) 
such that IT = 0 and there exists ô > 0 with 


lim [ IW (S)(a,p)| dadp = 0, 


e—0 


where S5 = {(q,p) € R4; |v(q@)|, |Dv(q)p - v(q)| < 5}. Let V be a matriz with 
conically intersecting eigenvalues as given in (2.2). Suppose, that q +> |v(q)| 
is convex, and that Dv(q)Vw(q) - v(q) < 0 for all q € R. 


Then, for all T > 0, the solution W<(t) of the Schrödinger equation (2.1) with 
initial data w*(0) = W§ satisfies 


sup (W (Y(t) — L'W(5)) alq, p) dqdp 
tE[0,T] J R24 


= O(R™) + O(R? Ve) + O(ve|lnel]) 
for alla = at H+ +a H with a* € CS°(R*\ S,C). 
0 
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Remark 2.3. Note, that the error is minimal as R = e~!/8 and is of order e!/8. 


Remark 2.4. The assumptions on the potential V(q) guarantee that minus- 
trajectories issued from the upper level never meet the crossing again. Hence, 
the system dealt with does not show any interlevel interferences, which could 
not be resolved by merley working on the diagonal of the Wigner matrix. 


Remark 2.5. The assumptions on the initial data (Y§)e>o ensure that the wave 
function does not localize near the manifold of minimal gap and the crossing 
manifold initially. This is due to the semigroup encorporating only an effective 
treatment of the non-adiabatic transitions, which becomes valid when the 
solution has passed by the crossing and the manifold of minimal gap. 


In [LT05], Theorem 2.2 has been proven for the linear isotropic potential 
(2.3) with an error of o(1) as e — 0. The proof for general potentials pro- 
viding the error bound with respect to €e and R is given in [FL06]. It falls 
into two parts: Away from the crossing, there is only classical transport. One 
shows by semiclassical Weyl calculus that the error of propagation is of order 
O(R~*) + O(./fe). Near the crossing, non-adiabatic transitions become rele- 
vant. For proving the correctness of the asymptotic transition rates one resorts 
to a refined version of the microlocal normal form of [CdV03]. 


2.3 Surface hopping algorithm 


The semigroup £t suggests a numerical algorithm that can be seen as a rigor- 
ous counterpart to the surface hopping algorithms of chemical physics. Such 
algorithms have been introduced by Tully and Preston in [TP71] for studies 
of molecular collisions. To our knowledge Theorem 2.2 is the first mathemat- 
ically rigorous justification for such an approach. For high dimensional prob- 
lems in photochemistry surface hopping seems to be one of the most popular 
algorithms employed. We refer to the review [ST05] as a pointer to the vast 
chemical literature on algorithms of this type. 

A numerical realization of the semigroup L‘ is achieved by the following 
steps: one projects the initial wave function to the energy levels and com- 
putes the associated Wigner functions. After sampling the Wigner functions, 
one propagates along the classical trajectories of the Hamiltonian systems 
(2.4) and opens up a new trajectory on the other level whenever a trajectory 
passes through the hypersurface of minimal gap. When splitting up, the weight 
associated with the trajectories is distributed according to the Landau-Zener 
transition coefficient (2.6). 

[LST06] thoroughly validates this algorithm for systems with linear isotropic 
potentials. Figure 2.1 shows the relative error of level populations for the fol- 
lowing test case: the initial data are Gaussian wave packets associated with 
the upper level localized at a distance of 5,/e from the crossing with aver- 
age momentum of order one. The time evolution stops at time t = 10,/e. 


Energy Level Crossings in Molecular Dynamics 587 





Absolute error 











j 1 
-4 -3 2 107 


10° 
Semiclassical parameter 





Fig. 2.1. The absolute error of the surface hopping algorithm versus a Strang 
splitting scheme for level populations with respect to the semiclassical parameter e. 
The dashed line is the function ie, while the solid line is the absolute error for 
the final populations on the two levels. 





The level populations || 7*~(t)||7.2 are computed by the rigorous surface hop- 
ping algorithm as well as by a numerically converged Strang splitting scheme. 
As a function of the semiclassical parameter, the resulting absolute error is 
bounded by se. All the other experiments of [LST06] show a comparable 
error of order y£, indicating, that the e!/8 error bound of Theorem 2.2 is not 
sharp. 


3 Spectral study 


Mathematically rigorous spectral studies of operators with crossing eigenval- 
ues have aimed at resolvent estimates [Jec0303, Jec05] and bounds on the num- 
ber of resonances [Néd96, Néd01, Néd03]. More explicit, quantitative investi- 
gations showing a clear spectral fingerprint of non-adiabatic origin have been 
undertaken by Avron and Gordon in the zero energy regime [AG00a, AGOOb]. 
Complementarily to these results, the joint work of the second author with Fu- 
jiié and Nédélec [FLN06] deals with Bohr-Sommerfeld conditions for energies 
bounded away from zero. 

The common model operator of [AGO0a, AGOOb, FLNO6] has a linear 
isotropic potential matrix with conically intersecting eigenvalues, 


H=-e2A V = -2A qı q2 , 
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Its scalar counterparts are the one-level operators 











H* = -e 4A; +|q\. 


The upper level operator H* has a confining potential, which is bounded 
from below by zero and increases to infinity as |q| oo. Hence, H* has purely 
discrete spectrum with strictly positive eigenvalues (see Theorems XIII.47 and 
XIII.67 in [RS78]). The lower level operator H~ has a repulsive potential. Its 
commutator with the generator of dilations D = x(q -Va + Vq: q) is positive, 


[H7 , iD] = —2A, + |q], 


and yields a global Mourre estimate. Hence, H~ has purely absolutely contin- 
uous spectrum (see Corollary 4.10 in [CF*87]). The full operator H, however, 
inherits the purely absolutely continuous spectrum of H~, while echoing the 
discrete spectrum of H* with resonances close to the real axis. 

The resonances of the operator H are defined by complex dilation (see 
Theorem 2.1 in [Néd96]). They are the eigenvalues Æ € C of the complex 
scaled Hamiltonian 

Ho = ee A, + eV (q), 


which is a non-selfadjoint operator with discrete spectrum in the lower half- 
plane independent of the dilation parameter 0 €]0, |. 


Remark 3.1. For a large class of scalar Schrödinger operators, resonances de- 
fined by complex dilation have been identified with the poles of a suitable con- 
tinuation of the resolvent or of the scattering matrix. The underlying physical 
picture is that of a slowly decaying state, whose life-time is set by the imag- 
inary part of the resonance. We refer to Chapter 8 in [CF*87], Chapter 16 
in [HS96], or the review [Zwo99] as introductory reading for the theory of 
resonances. 


A resonance E of H is determined by solving Hy = Ey in the distri- 
butional sense and validating decay and regularity properties of the dilated 
resonant state q > u(e~’%q). One uses the radial symmetry in V(q) to reduce 
the partial differential operator H to a direct sum of ordinary differential 
systems: the operator P is unitarily equivalent to 


P tino, Mnao = (F Ga) 


2 
A ev/r r+ 
vEZ+3 


where (r,p) € Rt x R. Nédélec [Néd96] has derived this equivalence by a 
Fourier transformation, a change to polar coordinates (r,¢) € R* x T, a 
conjugation by the half-angle rotation matrix 


0) n 
o 2 = one) . 
sin p COS 7 
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and a final Fourier series ansatz in the angular variable ¢. A similar decom- 
position labelled by half-integers has been obtained by Avron and Gordon 
[AG00a, AGOOD], using the commutation relation of H with an angular mo- 
mentum operator L, 


; 0i 
|H, L] = 0, L=an(-iev,) +4 (2, ae 

Both decompositions share the fact, that the half-integer labelling turns 
the conical intersection q > +|q| into a family of avoided crossings. However, 
while in [AG00a, AGOOb] the ordinary differential systems are solved in the 
zero energy regime in terms of generalized hypergeometric functions, the aim 
of [FLNO6] is an explicit asymptotic analysis (e — 0) of non-zero energies by 
means of an exact WKB method. 

According to the decomposition (3.1) one associates with a resonance Æ 
of H an angular momentum number v € Z + 4 if E corresponds to a distri- 


2 
butional solution of the ordinary differential problem 





H,(r, —i¢0,; €)u = Eu, 


such that r +> u(e~“’r) satisfies appropriate boundary conditions as r — 0 
and r — +00. Consider E €]a,b[ for positive numbers 0 < a < b. If € > 0 is 
sufficiently small, the energy surface 


{(r,p) € Rt x R; det (H,(r, p; €) — E) = 0} 


consists of two connected curves, a closed simple one and one being un- 
bounded. Let A, (E, €) be the action associated with the closed curve, 


A, (Eye) = 2 | y det (H,(r, 0; €) — E) dr, 


where 0 < ro < rı are the first and second positive zero of the mapping 
r+ det(H_(r,0;¢) — E), and the square root is taken positive. As a function 
of E, the action A, (E, €) is extended analytically into a complex neighborhood 
of the interval Ja, b[. The first result is the following Bohr-Sommerfeld type 
quantization condition of resonances with fixed angular momentum. 


Theorem 3.2 ([FLN06]). Let Ey > 0 andv € Z+ 4 be given. Then there 
exist ô > 0, €o > 0, and a function c(E,¢) : {(E,e) € Cx]0, eol; |E — Eo| < 
ô} > C with c(E, €) —> 0 uniformly in E as e > 0, such that E is a resonance 
of H = — A +V (q) with angular momentum v if and only if (E,€) satisfies 
the following quantization condition: 


= ye it/4 p73/4 eiA (E,6)/€ +1 = (c(E,e). (3.2) 
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To our knowledge, Theorem 3.2 is the first Bohr-Sommerfeld quantization 
condition for a Schrodinger system with crossing eigenvalues. The prefactor 
before the exponential of the action carrying the second scale y€ is a clear 
signature of non-adiabaticity, stemming from a connection formula involving 
the Landau-Zener problem 


. [ry og ep 8/4 2 
ie0,Uu = EC 7) u, Y= eE + O(e*). (3.3) 

For the proof of Theorem 3.2, the exact WKB method of Gérard and Grigis 
[GG88] is extended from scalar Schrödinger equations to a class of 2 x 2 first 
order differential systems, covering the case H,(r, —i€0,; €)u = Eu. The exact 
WKB solutions are of the form 


u(r) = etiz) ew(r), 2(r) = ih: y det(H,(s,0;¢€) — E) ds. 





They are locally defined away from turning points, which are the zeros of 
the mapping r +> det(H,(r,0;¢) — E). For E €lļa, b|, there are three positive 
turning points ro < rı < r2, where ro tends to zero, while rı and r2 coalesce 
at VE ase — 0. 

For obtaining global solutions, one connects the exact WKB solutions at 
the turning points, using the good e-asymptotics of the amplitude vector w. 
More precisely, one constructs an exact solution, which vanishes at the origin, 
and represents it after several connection procedures as a linear combination 
of solutions with controlled behavior at infinity. For r — +oo, there is a fun- 
damental system of solutions už, such that r+ u% (e~r) is exponentially 
growing and r +> uz(e~r) exponentially decaying. The quantization for- 
mula (3.2) corresponds to the condition that the connection coefficient of the 
exponentially growing solution ud, vanishes. 

The origin is a regular singular point of the equation H,(r, —ic0,; 6)u = Eu 
with indices +v. Moreover, the first turning point ro tends to zero as € — 0. 
One constructs an exact WKB solution in a small complex neighborhood of 
the origin, which corresponds to the index +v. Studying the two parameter 
asymptotics of this solution as (r,¢) — (0,0), one encounters the same diffi- 
culties as in the context of the Langer modification for the radial Schrodinger 
equation, see also [FR00]. The o(1) error estimate in Theorem 3.2 originates 
just from here. The rest of the proof gives better control on the convergence 
rate. 

Atr= VE, the second and third turning points rı and r2 coalesce as 
e — 0. The connection formula at this double turning point is calculated using 
a microlocal reduction to a normal form. Microlocally near (r, p) = (WE,0), 
the equation H,,(r, —i¢0,; €)u = Eu looks like the Landau-Zener problem (3.3). 
A further reduction step leads to the saddle-point problem 








2 
r(—ted,)u = a, 
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Fig. 3.1. Resonances of the model operator H = —e° A4 + V (q). The parameter 
k lies in {11, 12,...,60}, while v is chosen in {1.5,2.5,...,5.5}. The semiclassical 
parameter € varies from 107° to 1. (See page 694 for a colored version of the figure.) 


For this problem exact microlocal connection formulas are known [CP94, 

Ram96] and one has to lift them to the exact WKB solutions. The second 

scale y€ in the quantization condition (3.2) originates from this connection. 
An asymptotic study (Proposition 8.2 in [FLN06]) of the action yields 


A, (E,€)= 4p? + mve + O(e|lne]) (e= 0). 


Plugging this expansion into the Bohr-Sommerfeld condition (3.2), one is mo- 
tivated to define the following family of almost horizontal sequences: 





T,(a, b; €) = fa EC, A= Awe — iz (cnt—eln ae), 


k € Z s.t. Apve Ela, of 


with 
Aw = Z (8k —4v +5)  (KEZ,vEeZ+3$). 


Since Akne Ela, b[, the second term of the imaginary part of A € T, (a,b; €) 
is of O(e) and smaller than the first term —eln L, Hence, T, (a,b; e€) is an 
almost horizontal sequence in the lower half-plane with distance of O(eln +) 
from the real axis. The asymptotic distribution of resonances with real part 
in the positive interval ]a, b| reads as follows: 


Theorem 3.3 ([FLN06]). Let N E N and 0 <a < b be given. Then there is 
co > 0 and a positive function c:|0,€9[— Rt with c(e) = o(€) as e > 0 such 
that for each A € U,en Tu (a, b;€) there exists one and only one resonance of 
H = — A; + V (q) within the set {E € C; |E — VA) < c(e)}. 
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The plots in Fig. 3.1 illustrate the distorted lattice of resonances given by 
Theorem 3.3. The larger the angular momentum number v is taken, the closer 
the resonance is to the real axis and the longer the life time of the correspond- 
ing resonant states will be. This observation is in wonderful agreement with 
the dynamical properties of the operator P. The one-level operators H® in- 
duce Hamiltonian systems conserving angular momentum q ^p = q1p2— 42P1, 
which also encodes how close classical trajectories arrive near the crossing 
manifold {q = 0}. On the one hand, a high angular momentum number v 
of a resonance mirrors a periodic orbit of the upper level with high angu- 
lar momentum. Such orbits in turn imply existence of localized quasimodes 
and long-living resonant states. On the other hand, small angular momentum 
numbers v correspond to orbits close to the crossing manifold. Close to the 
crossing, non-adiabatic transitions to the unbounded motion of the minus- 
system are possible. In this regime shorter life-times and resonances far away 
from the real axis have to be expected. 
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Summary. We study the motion of electrons in a periodic background potential 
(usually resulting from a crystalline solid). For small velocities one would use either 
the non-magnetic or the magnetic Bloch hamiltonian, while in the relativistic regime 
one would use the Dirac equation with a periodic potential. The dynamics, with the 
background potential included, is perturbed either through slowly varying external 
electromagnetic potentials or through a slow deformation of the crystal. In either 
case we discuss how the Hilbert space of states decouples into almost invariant 
subspaces and explain the effective dynamics within such a subspace. 


1 Introduction 


In a crystalline solid the conduction electrons move in the potential created 
by the ions and the core electrons. Somewhat mysteriously and linked to the 
Pauli exclusion principle, the Coulomb repulsion between conduction electrons 
may be ignored, within a good approximation. Thereby one arrives at a fun- 
damental model of solid state physics, namely an ideal Fermi gas of electrons 
subject to a periodic crystal potential. Let I" be the periodicity lattice. It is 
a Bravais lattice and generated through the basis {71, Y2, 73}, y; € R3, as 


3 


j=l 


The crystal potential Vp is then T-periodic, i.e., Vp : R3 — R and Vp(a + 
y) = Vr() for all y € I’, and the electrons are governed by the one-particle 
hamiltonian i 

Hsg = -53r + Vr. (1.2) 


Hgp is the (Schrédinger)-Bloch hamiltonian. A wave function y+ € L?(R?) 
evolves in time according to the Schrodinger equation 
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io = Aspyr. (1.3) 


We have chosen units such that the mass of an electron me = 1 and h = 1. 
The electron charge, e, is absorbed in Vr. Since Vpr is periodic, electrons move 
ballistically with an effective dispersion relation given by the Bloch energy 
bands En, see below for a precise definition. En is periodic with respect to 
the lattice [* dual to T, En(k + *) = En(k) for all y* € I'*, k € R°. This 
feature makes the dynamical properties of a Bloch electron very different from 
a massive particle with dispersion Efree(k) = $k? valid in case Vr = 0. 

The thermodynamics of the electron gas is studied taking Hgp as a starting 
point. Dynamically, however, one wants to probe the response of the electrons 
to external forces which very crudely come in two varieties. 


(i) External electromagnetic potentials. Electrostatic potentials manufactured 
in a lab have a slow variation on the scale of the lattice I’. Therefore we 
set Vext(z) = ed(ex), e the charge of the electron, with € a dimensionless 
parameter and ¢ independent of £. € < 1 means that the potential V.x4 has a 
slow variation when measured with respect to the lattice spacing of I’. Note 
that the electrostatic force is O(¢) and thus weak. External magnetic fields 
on the other hand can be so strong that the radius of gyration is comparable 
to the lattice spacing. It then makes sense to split the vector potential as 
Ao + Aext, where Ag(x) = -4 Bo Aa with Bo € R? a constant magnetic field. 
Included in Hgp, this yields the magnetic Bloch hamiltonian 





oes 
Hyg = 5(-iVe — Ao)? + Vr. (1.4) 


Aext is a probing vector potential in addition to Ap. Aext is slowly varying 
on the scale of the lattice, Aext(a) = A(ex) with A independent of £, and the 
corresponding magnetic field is small of order £. Including all electromagnetic 
potentials, for simplicity with the electric charge absorbed into A and ¢, the 
hamiltonian becomes 

1 


H = 5(—#Vs — Ao(e) A(ex))” + Vr(x) + gex). (1.5) 





(ii) Mechanical forces. The crystal lattice can be deformed through exter- 
nal pressure and shear. Thereby an electric polarization is induced, an ef- 
fect which is known as piezoelectricity. If charges are allowed to flow, in this 
way mechanical pressure can be transformed into electric currents. The me- 
chanical forces are time-dependent but slow on the typical time-scale of the 
electrons. Therefore in (1.2) Vp(x) is replaced by Vren (x, et). T (et) is the 
instantaneous periodicity lattice and is defined as in (1.1). Vr@) is space- 
periodic, i.e. Vr (x + y,t) = V(a,t) for all y € I(t). The special case of a 
time-independent lattice, T(t) = I’, but a still slowly in time varying crystal 
potential is also of interest. For example, one may imagine a unit cell with 
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two nuclei. If the two nuclei are displaced relative to each other, then I’ re- 
mains fixed while the crystal potential in the unit cell changes with time. The 
resulting piezoelectric hamiltonian reads 


1 
Hpg(t) = 5c + Vr (et) (x, et) : (1.6) 


Our general goal is to understand, in each case, the structure of the solu- 
tion of the time-dependent Schrödinger equation for small e. Obviously, H in 
(1.5) is a space-adiabatic problem, while (1.6) corresponds to a time-adiabatic 
problem. However in the latter case it turns out to be profitable to transform 
to a time-independent lattice, say ['(0). Then also terms varying slowly in 
space are generated. Thus, in the general case the full power of the space- 
adiabatic perturbation theory [PST03a, Teu03] will be needed. A word of 
caution must be issued here for the magnetic Bloch hamiltonian. To use the 
methods from [PST03a] in this context, the unperturbed Hamiltonian must 
be periodic, which is the case only if the magnetic flux per unit cell is rational. 
One can then define an enlarged magnetic unit cell such that Hms is invariant 
with respect to magnetic translations. If the magnetic flux is not rational, the 
crutch is to include in Ao a nearby rational flux part of the magnetic field, 
with a small denominator, and to treat the remainder as Aext- 

To achieve our goal, depending on the context we use one of the peri- 
odic hamiltonians as backbone. The periodic hamiltonian is denoted by Hper 
with per either Asp, or Amp, or Hpg at fixed t, or His from (1.8), or Hpg 
from (1.9). As explained below, the Hilbert space H = L? (R?) then splits 
as H = Bro Hn, where n is the band index. Each subspace Hn is invari- 
ant under exp[—itHper] and Hper restricted to Hn is unitarily equivalent to 
multiplication by E,(k). En(k) is the effective hamiltonian associated to the 
n-th band. The complexity of the full problem has been reduced substan- 
tially, since only a single band dynamics has still to be studied. Modifying 
Hyper such that it becomes slowly varying in space-time is, vaguely speaking, 
a small perturbation. Thus one would expect that the invariant subspace Hn 
is to be substituted by a slightly tilted subspace. On this subspace E,,(k) 
will turn into a more complicated effective hamiltonian. The difficulty is that, 
while the dynamics generated by Hper can be computed by solving a purely 
spectral problem, none of the perturbed hamiltonians can be understood in 
this way. In particular, one has to spell out carefully over which time scale 
the slightly tilted subspace associated to Hn remains approximately invari- 
ant and in what sense the dynamics generated by the effective hamiltonian 
approximates the true time evolution. 

To lowest order the effective hamiltonian can be guessed from elementary 
considerations and belongs to a standard tool of solid state physics [AM76]. 
The guess provides however little hint on the validity of the approximation. 
There one needs a mathematical theorem which states precise conditions on 
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the initial wave function and provides an error bound, from which the time 
scale for validity can be read off. 

Under the header “geometric phase” physicists and quantum chemists have 
realized over the past twenty years, say, that the first order correction to the 
effective hamiltonian carries a lot of interesting physics, see [BM*03] for a 
recent comprehensive overview. For the magnetic Bloch hamiltonian the first 
order correction yields a Hall current proportional to the Chern number of the 
magnetic Bloch vector bundle. Similarly, the modern theory of piezoelectricity, 
expresses the piezocurrent as an integral of the Berry connection over the 
Brillouin zone, see King-Smith, Vanderbilt [KSV93] and Resta [Res94]. First 
order effective Hamiltonians are no longer guessed so easily and it is convenient 
to have the systematic scheme [PST03a] available. 

In nature electrons are spin 4 particles. The wave function is thus C?- 
valued and the hamiltonian in (1.5) is modified to 


H = 5(—iVe— Aol) — A(e2))? +Vr(2) + 4er) - Lo (Bo+eB(ex)) (1.7) 


with B = V AA for the slowly varying part of the magnetic field. Here 
o = (01,02,03) is the 3-vector of Pauli spin matrices. Besides the term pro- 
portional to the uniform magnetic field By, H acquires a subleading term of 
order €. More accurately one may want to include the spin-orbit coupling. 
The periodic piece of the hamiltonian reads then 


Hrs = a +Vp(x) + Lo - (WVr(x) A (-iVe)) (1.8) 


and the slowly varying potential is added as in (1.7) with the additional sub- 
leading term eyo - (Vd(ex) A (—iV+)). 

Depending on the crystalline solid, the conduction electrons can move so 
fast that relativistic corrections become important. On the one-particle level 
an obvious choice is then the Dirac equation with a periodic potential Vr. 
Wave functions are C4-valued and the hamiltonian reads 


Hpg = bme? +ca:p+Vr, p= —iVe. (1.9) 


We introduced here the mass, me, of the electron and the speed of light, c. 
The 4 x 4 matrices 3,01, v2, a3 are standard and defined in [Tha94, Ynd96], 
for example. Note that the Lorentz frame is fixed by the solid, i.e. by Vr. 

In fact, the non-relativistic limit for Hpp yields the spin-orbit hamiltonian 
Hyg [Tha94, Ynd96]. If ||Vr|| is bounded, for sufficiently large c, the Dirac 
hamiltonian Hpg has a spectral gap, which widens as c — oo. Projecting 
onto the electron subspace, to leading order in 1/c one obtains the Pauli- 
Bloch hamiltonian —(1/2m.)Az + Vr with the spin-orbit coupling in (1.8) as 
a correction of strength 1/(mec)?. In addition the crystal potential is corrected 
by —A,Vr(x)/8(mec)?. 

In our contribution we will provide some background on how to establish, 
including error bounds, the validity of the approximate dynamics as generated 
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by an effective hamiltonian, including order € corrections, for most of the 
models mentioned in the introduction. For this purpose it is necessary to 
briefly recall the spectral theory for the periodic hamiltonian, which is done 
in the following section. In the subsequent sections we deal with particular 
cases in more detail. We start with the non-magnetic Bloch hamiltonian, see 
(1.5) with Bo = 0. For the magnetic Bloch hamiltonian we explain how Bo —> 0 
and Bo — co may be viewed as particular adiabatic limits. Piezoelectricity is 
discussed in the last section. 


Remark. Our contribution is one part of the research project jointly with 
S. Bauer and M. Kunze within the Schwerpunkt. Their part will be covered in 
[BK06]. Both contributions appear now as almost disjoint, which only reflects 
that we wanted to present a coherent story. The unifying aspect is an adiabatic 
limit for wave-type evolution equations. In this contribution we stay on the 
level of effective hamiltonians while in [BK06] one pushes the scheme to the 
first dissipative correction. 


2 The periodic hamiltonians 


We consider a general dimension, d, and assume that the periodicity lattice 
I is represented as 


r= {ee R?:2= Ysa; for some a € Z4}, (2.1) 


where {71,..., Ya} are vectors spanning R?. We denote by I’* the dual lattice 
of I’ with respect to the standard inner product in R4, i.e. the lattice generated 
by the dual basis {yj,...,77} determined through the conditions y; -yj = 
2rõij, i,j € {1,...,d}. The centered fundamental domain M of I is defined 
by 

M = {we Rt: 2 = 54815 for B; € l-44}, (2.2) 


and analogously the centered fundamental domain M* of T*. The set M* is 
the first Brillouin zone in the physics parlance. 


Assumption 1. The crystal potential Vr : R? — R satisfies Vp(x + y) = 
Vr(x) for ally € I, x € Rt. Vr is infinitesimally bounded with respect to 
—A. 


It follows from Assumption 1 that the periodic hamiltonians discussed 
below are self-adjoint on the domain of — A. 


2.1 The Bloch hamiltonian 


We consider 
H =—-A+Vr. (2.3) 
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The periodicity of H is exploited through the Bloch-Floquet-Zak transform, 
or just the Zak transform for sake of brevity [Zak68]. The advantage of such 
a variant is that the fiber at k of the transformed Hamiltonian operator has 
a domain which does not depend on k. 

The Zak transform is defined as 


(Uzw)(k, x) := soo T+Nab(a +), (k, x) € R24, (2.4) 
vel 


initially for a fast-decreasing function w € S(R“). One directly reads off from 
(2.4) the following periodicity properties 


(Uzb)(k,y +7) = (Uzv)(k,y) forall yer, (2.5) 


(Usb) (k +A, y) =e (Uzy)(k,y) foral AET™. (2.6) 


From (2.5) it follows that, for any fixed k € R?, (Uzy)(k,-) is a T-periodic 
function and can thus be regarded as an element of He = L?(M). M = 
R/T and it has the topology of the d-dimensional torus T¢. On the other 
side, Equation (2.6) involves a unitary representation of the group of lattice 
translations on I’* (isomorphic to ’* and denoted as A), given by 


T:A>U(He), AMT A), (TA) u) =e ely). (2.7) 
It is then convenient to introduce the Hilbert space 
Hr = {p € LERI, He): v(k-A)=7(A) Y(k) for all A € A} 
= 12(R", Hy), (2.8) 


equipped with the inner product 
= fab), ena. (2.9) 


Obviously, there is a natural isomorphism between H, and L?(M*,H¢) given 
by restriction from R? to M*, and with inverse given by t-equivariant con- 
tinuation, as suggested by (2.6). Equipped with these definitions, one checks 
that the map in (2.4) extends to a unitary operator 


Uz : L?(R*) > H, S L?(M*, L?(M)), (2.10) 


with inverse given by 
(Uz *e)(x) = a dk e'** p(k, [2]), (2.11) 


where [-] refers to the a.e. unique decomposition x = Ys + [x], with ys € I 
and [z] € M. 
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As already mentioned, the advantage of this construction is that the trans- 
formed hamiltonian is a fibered operator over M*. Indeed, for the Zak trans- 
form of the hamiltonian operator (2.3) one finds 


(S> 
Ghia = dk H(k) (2.12) 
M* 
with fiber operator 
1 
H(k) = 5(—iVy +k)” +Vr(), ke M*. (2.13) 


By Assumption 1, for fixed k € M*, the operator H(k) acts on L?(M) 
with the Sobolev space H?(M) as domain independently of k € M*. Each 
fiber operator H(k) has pure point spectrum accumulating at infinity. For 
definiteness the eigenvalues are enumerated according to their magnitude 
E(k) < E(k) < E(k) < ... and repeated according to their multiplicity. 
En : M* — R is the n-th energy band function. It is continuous on M* when 
viewed as a d-torus. Generically the eigenvalues E£,,(k) are non-degenerate. 
Of course, there may be particular points in k-space where particular energy 
bands touch each other and the corresponding eigenvalue becomes degenerate. 
The normalized eigenfunction corresponding to En(k) is the Bloch function 
and denoted by yn(k) € H?(M). It is determined only up to a k-dependent 
phase factor. A further arbitrariness comes from points where energy bands 
touch. We denote by P,,(k) the projection along y,,(k) and set 


(S> 
P= dkP(k), Hn = P L’ (R*). (2.14) 
M* 


Through the Zak transform we have achieved the product structure 
H=H: 9H, Hs = L?’ (M*), He = L?(M). (2.15) 


Wp E€ Hn is of the form $(k)yn(k, y). The band index n fixes the local pattern of 
the wave function ~ while ¢(k) provides the slow variation. Therefore L?(M*) 
is the Hilbert space of the slow degrees of freedom. On the other hand for 
fixed k, one has oscillations in time determined by the eigenvalues En(k). On 
long time scales, these become fast oscillations. Therefore He = L?(M) is the 
Hilbert space of the fast degrees of freedom. 

Since [P,,, H] = 0, the subspaces Hn are invariant under e~'”*. P,e~i#'P,, 
is unitarily equivalent to multiplication by e~!”"(")! on L?(M*). Note that, in 
general, Hn is not a spectral subspace for H. The band functions generically 
have overlapping ranges. Therefore, if slowly varying terms are added to the 
hamiltonian, the dynamics can no longer be captured so easily by a spectral 
analysis of the perturbed hamiltonian. 
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2.2 The magnetic Bloch hamiltonian 


We consider d = 3. The hamiltonian reads 
1 
H = 5(—#V.—A(e))” + Vr(e), ceER*, (2.16) 


with A(z) = —$BA z, B € R®. Physically the most relevant case is d = 2. It 
is included here by setting z = (z1, 22,0) and B = (0,0, Bo). Following Zak 
[Zak64], see also [DGR02], one introduces the magnetic translations 


(Tors)(2) = (et T+A) (2) = CAM — a) (2.17) 
with a € RÌ. They satisfy the Wey] relations 
TT = 6 OMT ge OTT, (2.18) 
To have a commuting subfamily we need 


Assumption 2. The magnetic field B is such that B- (y AY’) E€ 2nQ for all 
yy EL. 


In the two-dimensional case our assumption requires that the magnetic 
flux per unit cell, Bo - (71 A 72), is a rational multiple of 27. 

Under the Assumption 2 there exists a sublattice Ig C I such that B- (yA 
y’) € 2nZ for every y, y’ € Ip. To is not unique. The set {Ta}acr is a family 
of commuting operators, which commute with H. Since T,7Tg = +Ta+g, the 
magnetic translations still form only a projective group. It becomes a group 
by an even smaller sublattice T} C I such that B- (y Ag’) € 47Z for all 
y, y! € I. Another common choice is a further modification of the phase 
through 





Ta = CPP, (2.19) 


with y(a) = Byaza3+ B3a1a2— B2013. Then TaT = Ta+g for alla, 8 € Ip. 
We can now proceed as in the non-magnetic case. The Zak transform 
becomes 


Cadb)(k,2) = Sr EDT A(x), (yn) ERS. (220) 

velo 
The properties of Uzy) are as in (2.2), (2.3) provided I is replaced by Io, 
and H, is replaced by HZ = {u € L? .(R?, He) : (2.23) below holds true}. In 


particular, H of (2.16) admits the fiber decomposition 


@ 
UzHUz' = dkH(k) (2.21) 
M* 


with M* the first Brillouin zone of Iğ and with the fiber operator 


H(k) = Iiv, + IB Ay th? + VrO). (2.22) 
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The domain of H(k) is independent of k but, in contrast to H(k) from (2.13), 
a function u in the domain has to satisfy the more complicated boundary 
condition 

EW FY OB) u(y — a) = uly). (2.23) 
2.3 Dirac hamiltonian, spin-orbit coupling 
The Dirac hamiltonian with periodic potential reads 


H = B-ia-Vet+Vr(a), (2.24) 


where we have set Me = 1, c= 1. As for the Bloch hamiltonian, H admits the 
fiber decompositon 


® 
H = dkH(k) (2.25) 
M* 
with fiber hamiltonian 
H(k) =B+a-(-iVy+k)+Vr(y). (2.26) 


H(k) acts on L?(M,C*) with periodic boundary conditions (2.5). The free 
Dirac operator has a spectral gap of size 2, in our units, between the elec- 
tron and positron subspace. If we assume ||Vr|| < 1, then this gap persists 
and the eigenvalues can be labelled as Eo(k) < E(k) < ... in the electron 
subspace and as E_1(k) > E_o(k) > ... in the positron subspace. One has 
E_,(k) < Eo(k) for all k € M*. (In fact the labelling can be achieved without 
a restriction on ||Vr||, see [Mau03}). 

For Vr = 0, the eigenvalue E(k) is two-fold degenerate reflecting the spin 
$ of the electron, resp. positron. This degeneracy persists if the periodic po- 
tential is inversion symmetric, see [Mau03] for details. 


Proposition 2.1. Let H be given by (2.24) with ||Vr|| < 1. Let there exist 
a € R? such that Vr(a + a) = Vr(—a% + a). Then each En(k) is at least 
two-fold degenerate. 


Proof. Without loss of generality we may assume a = 0. We use the standard 
basis for the a-matrices, see [Tha94]. In this basis time-reversal symmetry is 
implemented by the anti-unitary operator 


Ty(y) = —iagaiy"(y), (2.27) 
where the complex conjugation is understood component-wise. Using that 
apazay, = —a3a,0%, l = 1,2,3, where @ refers to matrix element-wise com- 


plex conjugation, one checks that 
—iVyaT = —iTVyor, kaeT = —Tkae (2.28) 


and therefore 
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T 1H(k)T = H(-k). (2.29) 
Secondly we use space inversion as 
Ruy) = Bv(-y). (2.30) 


Then 
R-'H(k)R = H(-k). (2.31) 


Combining both symmetries implies 
TOUR RT = H(k). (2:32) 


If H(k)w = Ey, then also RTw is an eigenfunction with the same eigen- 
value. Thus our claim follows from (y, RT) = 0. To verify this identity we 
note that —iaga, = diag (02,02) and (x, Roe2x*) = 0 for an arbitrary two- 
spinor y. 


Corollary 2.2. The eigenvalue E,,(0) of H(0) is at least two-fold degenerate. 


Proof. Since T*H(0)T = H(0) by (2.29) and (4, (—ia3a1)y~*) = 0, the claim 
follows. 


If Vr is not inversion symmetric, generically an energy band is two-fold de- 
generate at k = 0 and then splits into two non-degenerate bands. Note that a 
non-degenerate eigenvalue E,,(k) has an associated eigenvector with a definite 
spin orientation. 

The Pauli equation with spin-orbit coupling has the hamiltonian 


ne =A; DI 7 (VV pla) KENI: (2.33) 


After Zak transform the corresponding fiber hamiltonian becomes 





H(k) = =(-iVy +k)? + Vr(y) 4 Lo- (WVr(y) A (—iVy +4) (2.34) 


with periodic boundary conditions. H of (2.33) is bounded from below. But 
otherwise the band structure is similar to the periodic Dirac operator. Proposi- 
tion 2.1 and Corollary 2.25 hold as stated. In the proof one only has to use the 
appropriate time-reversal operator, which is TY = ooy* in the o3-eigenbasis. 


2.4 Gap condition and smoothness 


Let us consider one of the periodic hamiltonians, Hper, with fiber decompo- 
sition H(k). Hper is adiabatically perturbed to H®. Very crudely the corre- 
sponding unitary groups should be close. To make such a notion quantitative 
a gap condition must be imposed. We denote by o(#) the spectrum of the 
self-adjoint operator H. 
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Gap condition: We distinguish a family of m physically relevant energy 
bands {E,(k), n < j < n+m-— 1} = solk). This family satisfies the gap 
condition if 


dist{oo(k), o(H(k)) \ ao(k)} > g >0 forall k € M*. (2.35) 
We repeat that the gap condition is not a spectral condition for Hper. Let us 
set P? = Arias Pj. 


Under the gap condition the projector P(k) depends smoothly, in many 
cases even (real) analytically, on k. Ran P°(k) is spanned by the basis 
{y;(k)}j=n,....nt+m—1- If the m relevant energy bands have no crossings amongst 
each other, then y; is necessarily an eigenvector of H (k) satisfying H(k)p,;(k)= 
E,(k)y;(k). But if there are band crossings, it can be convenient not to insist 
on v;(k) being an eigenvector of H(k). Thus, while P°(k) is unique, the span- 
ning basis is not. In applications it is of importance to know whether there is at 
least some choice of yj (k), j =n,...,n+m-—1, such that they have a smooth 
k-dependence. Locally, this can be achieved. However, since M™* has the topol- 
ogy of a torus, a global extension might be impossible. In fact this will gener- 
ically happen for the magnetic Bloch hamiltonian, see [DN80, Nov81, Lys85] 
for examples. Somewhat surprisingly, a reasonably general answer has been 
provided only recently [Pan06]. For the case of the Bloch hamiltonian, ana- 
lyticity has been proved before in cases d = 1, m arbitrary, and d arbitrary, 
m = 1, see Nenciu [Nen83, Nen91] and Helffer, Sjostrand [HS89]. They rely 
on analytical techniques. In [Pan06] topological methods are developed. 


Proposition 2.3. In case of the non-magnetic Bloch hamiltonian let either 
d<3,me€EN ord>4, m=1. Then there exists a collection of smooth maps 
R? > k 9;(k) € L?(M), j=n,...,n+m-—1, with the following properties 
(i) the family {yj (k)}j=n,....ntm—1 is orthonormal and spans the range of 
POC). 


(ti) each map is equivariant in the sense that 
pilk) =T(A)p(k +A) for allk CR4, AEA, (2.36) 


where T(X) is multiplication by et^”. The same property holds for the non- 
magnetic periodic Dirac operator and Pauli operator with spin-orbit coupling. 


Remark. The proof uses the first Chern class of the vector bundle whose 
fiber at k is the span of the family {yj(k)}j<n,....ntm—1 i.e. Ran P°(k). To 
establish continuity, and thus smoothness, this first Chern class has to vanish, 
a property, which does not hold for a magnetic Bloch hamiltonian except for 
some particular energy bands. 


If e is small, excitations across the energy gap are difficult to achieve. 
More precisely to P? one can associate a projection operator J€ such that for 
arbitrary @,¢’ € N, 7 € R+, it holds 
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| — We PIT | < coe lre lll (2.37) 


for 0 < t < e-“7 with suitable constants cee (T) independent of £. In other 
words that the subspaces JIH and (1 — I*)H almost decouple, i.e. decouple 
at any prescribed level of precision and over any polynomial length of the time 
span under consideration. For the specific case of the Bloch hamiltonian more 
quantitative details on the decoupling are provided in Sect. 3. 

If the gap condition is not satisfied, the dynamical properties are much 
more model dependent. Firstly the gap condition can be violated in various 
ways. In our context, since H(k) has discrete spectrum, the violation comes 
through band crossings. The behavior close to a band crossing has to be 
studied separately [LT05, BT05]. In other models the energy band sits at the 
bottom of the continuous spectrum of H(k) without gap [Teu02]. Then an 
assertion like Equation (2.37) holds only under a suitable restriction to small 
£, l, usually £,” = 1 or perhaps £ = 2, ” =1. 

The inequality (2.37) makes no assertion about the dynamics inside the 
almost invariant subspace JISH. While there is a general theory available 
[PSTO03a], we prefer to discuss the examples separately in the subsequent 
sections. 


3 Nonmagnetic Bloch hamiltonians: Peierls substitution 
and geometric phase corrections 
We discuss in more detail the effective dynamics for the Schrödinger equation 


with a periodic potential. For concreteness we fix the spatial dimension to be 
3. Under Zak transform the nonmagnetic Bloch hamiltonian becomes 


Uz(5(—iV2—Alen))? + Vr(a) + olen) Ug! =H (8.1) 


with 





HS A E E a EE T (3.2) 


Here Vj, is differentation with respect to k and satisfying the y-dependent 
boundary conditions (2.6). H% is a self-adjoint operator on L2 (RÌ, H?(M)), 
compare with (2.8). 

In (3.2) we observe that the external potentials couple the fibers. To em- 
phasize this feature we think of (3.2) as being obtained through Weyl quan- 
tization from the operator valued function 


AN E a(-i, +k- Alr)? + Vr(y) + elr) (3.3) 


as defined on (r,k) € R and acting on He with fixed domain H?(M), see 
[PSTO3b] for details. In this form one is reminded of the Weyl quantization 
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of the classical hamiltonian function ha(q,p) = $p? + V (q) which yields the 
semiclassical hamiltonian 


|: Liev)? + V(x) (3.4) 
acting in L?(IR°). The analysis of (3.4) yields that on the time-scale e~'t the 
wave packet dynamics governed by Hse well approximates the flow generated 
by ha. In contrast, the adiabatic analysis deals with operator valued symbols, 
as in (3.3), and has as a goal to establish that the dynamics decouples into 
almost invariant subspaces and to determine the approximate dynamics within 
each such subspace. 

To be specific, let us then fix throughout one band index n and let us 
assume that the band energy En is nondegenerate and satisfies the gap condi- 
tion. Therefore we know that En : M* — R is smooth and we can choose the 
family of Bloch functions y,(k), with H(k)yn(k) = En(k)yn(k), such that 
Pn depends smoothly on k. For each ¢ € N = {0,1,...} there exists then an 
orthogonal projection [Jj on H, such that 


LHS, HEll < cee" (3.5) 


for some constants ce. Integrating in time one concludes that the subspaces 
II;H, are almost invariant in the sense that 


Id — per HE TTF] < el + leleeet tte” (3.6) 


for any 4, V € N. Note that the adiabatic time scale, order et, can have any 
power law increase, at the expense of choosing the order of the projection 
IT; sufficiently large. Only for times of order e'/© one observes transitions 
away from the almost invariant subspace. The zeroth order projection is at- 
tached to the n-th band under consideration, while the higher orders are 
successively smaller corrections to J6. To construct I one considers the 
projection onto the n-th band, |yn(k)) (yn (k)|, as an operator valued function 
with values in B(H¢). From it we obtain the minimally substituted projection 
lyn(k—A(r))) (Yn (kK—-A(r))|. Its Weyl quantization is ¢-close to the orthogonal 
projection I. 

The second task is to determine the approximate time-evolution on HF H+. 
Since the subspace changes with €, it is more convenient to unitarily map 
II;H, to an €-independent reference Hilbert space, which in our case is simply 
L?(M*). The dynamics on L?(M*) is governed by an effective hamiltonian. It 
is written down most easily in terms of a hamiltonian function h§ : M* x RÌ > 
R. h§ is a smooth function. We also may regard it as defined on R? x R? and 
I™*-periodic in the first argument. hj admits the power series 


£ 


hg =X hj (3.7) 


j=0 
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with e-independent functions hj. The effective quantum hamiltonian is ob- 
tained from hj through Weyl quantization. The index £ regulates the time 
scale over which the approximation is valid and the size of the allowed error. 

In [PST03b] we provide an iterative algorithm to compute hj. In practice 
only ho and hy can be obtained, at best hə under simplifying assumptions. 
While this may look very restrictive, it turns out that already hı yields novel 
physical effects as compared to hg. Even higher order corrections seem to be 
less significant. 

To lowest order one obtains 


ho(k,r) = En(k — A(r)) + 9(r), (3.8) 
which Weyl-quantizes to 
W* [ho] = En(k — A(ieVe)) + b(ieVe) (3.9) 


acting on L?(M*), where iV, is the operator of differentiation with periodic 
boundary conditions. (The twisted boundary conditions appearing in (3.2) are 
absorbed into the unitary map of HEH to L?(M*).) In solid state physics the 
Weyl quantization (3.9) is referred to as Peierls substitution. (3.8), (3.9) have a 
familiar form. The periodic potential merely changes the kinetic energy $k? of 
a free particle to E,,(k). The main distinctive feature is the periodicity of the 
kinetic energy. For example, in presence of a linear potential ¢, (x) = —E-a, 
an electron, initially at rest, will start to accelerate along E but then turns 
back because of periodicity in k. 
To first order the effective hamiltonian reads 


hi(k,r) = (Vlr) — VEn(k) A B(r)) « An(k) — B(r) : Mn(k), (3.10) 


with the kinetic wave number k = k — A(r). The coefficients A, and Mp are 
the geometric phases. They carry information on the Bloch functions y,(k). 
An is the Berry connection given through 


Anlk) = ilpn(k), Veen (k)) rH (3.11) 


and M,, is the Rammal-Wilkinson phase given trough 
1. 
Mn(k) = si(Veen(k), MAK) — En(k)) Vien (k)) 11; - (3.12) 


The Bloch functions y, are only determined up to a smooth phase a(k), 
i.e. instead of y,(k) one might as well use e~'*") p, (k) with smooth a : M* > 
R. Clearly M,, is independent of the gauge field a. On the other hand, A is 
gauge-dependent while its curl 


Qn = TAN An (3.13) 


is gauge independent. From time-reversal one concludes that 
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Na (=k) = -2p (k). (3.14) 


In particular, in dimension d = 2 for the first Chern number of the Bloch 
vector bundle one obtains 


J dkQn(k) = 0. (3.15) 


For the magnetic Bloch hamiltonian, (3.14) is violated in general, see 
Sect. 4. The integral in (3.15) can take only integer values (in the appropriate 
units) and the first Chern number may be different from zero. Physically this 
leads to the quantization of the Hall current [PST03b, SN99]. 

We still owe the reader precise a statement on the error in the approxi- 
mation. At the moment we work in the representation space at precision level 
L = 1. Let Heg be the Weyl quantization of ho + €hj, see (3.8) and (3.10). 
There is then a unitary U® : 7H, — L?(M*) such that for all y € H, 


I (eF2* — Ue Mente) TIF < ellyl|A + [r[e? (3.16) 


with |t| < e7tr and some constant c independent of ||w||, 7, and e. 


4 Magnetic Bloch hamiltonians: the Hofstadter butterfly 


We turn to a magnetic Bloch hamiltonian in the form (1.4), in dimension d = 2 
and with a transverse constant magnetic field Bo. We want to explain how 
the limits Bo — co and Bo — 0 can be understood with adiabatic methods. 
As a remark, it is worthwhile to recall that, when the physical constants are 
restored, the dimensionless parameter Bo is given by 


BoS 
ae 2rhc/e’ 





(4.1) 


where S is the area of the fundamental cell of I’ and Bo the strength of 
the magnetic field, both expressed in their dimensional units. Thus Bo corre- 
sponds physically to the magnetic flux per unit cell divided by hc/e, as the 
fundamental quantum of magnetic flux. This section is based essentially on 
[FP06], which elaborates on previous related results [Bel86, HS89]. 

Adiabatic limits are always related to separation of time-scales. In the 
present case, one indeed expects that as By — co the cyclotron motion induced 
by Bo is faster than the motion induced by Vr, while in the limit By — 0 the 
microscopic variations of the wave function induced by Vr are expected to be 
faster than the cyclotron motion. 

Let us focus first on the Landau regime Bo — oo. In order to make quan- 
titative the previous claim, one introduces the operators 


Lı = Ti (pı + $Bo 22) , 
eet, (4.2) 
L = Te (p2 — $Bo21) , 
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and the complementary pair of operators 


Gi = 5 (pı — $Bo z2) , 


[Gi, G2] = 5-1 (4.3) 


o 9 


G2 = ge (P2 + 4Bo21) , 
where the relative sign is chosen such that [L;,G,] = 0, for i, j = 1,2. 

If Vr = 0, then Hyp describes a harmonic oscillator, with eigenfunctions 
localized on a scale |Bo|~'/?; this corresponds to the cyclotron motion in 
classical mechanics. Since [G;, Hyp] = 0, the operators Gı and G2 describe 
conserved quantities, which semiclassically correspond to the coordinates of 
the center of the cyclotron motion. 

If Vr Æ 0, but the energy scale ||Vr|| is smaller than the cyclotron en- 
ergy + Bo, then the operators G; have a non-trivial but slow dynamics. By 
introducing the adiabatic parameter 7 = 1/Bo the hamiltonian reads 


1 
Hmg = Di (Li + L3) + Vr (Go = vnLo, —-Gı + viL) : (4.4) 
In view of the commutator [G1, G2] = in1, one can regard 7 Hms as the 
n-Weyl quantization (in the sense of the mapping (q, p) > (G1, G2)) of the 
operator-valued symbol 


1 
hp(q, p) = 5 (Li + La) +nVr (p— J/nl2,-q+ Jnl). (4.5) 
For each fixed (q, p) € R?, hmg (q, p) is an operator acting in the Hilbert space 
Hse = L? (R) corresponding to the fast degrees of freedom. If ||Vr||gc11) < 00, 
then hyrp(q,p) has purely discrete spectrum, with eigenvalues 


1 
An, n (q, p) = (n + 5) + nVr(p, —q) F Oln?) ne N, 


as 7 | 0. The index n € N labels the Landau levels. For 7 small enough, each 
eigenvalue band is separated from the rest of the spectrum by an uniform gap. 
Thus we can apply space-adiabatic perturbation theory to show that the band 
corresponds to an almost-invariant subspace Mn, „L? (R?). Let us focus on a 
specific n € N. One can prove that the dynamics inside Ran Mn, „L? (R?) is de- 
scribed by an effective hamiltonian, which at the first order of approximation 
in 7 reads 


1 
hl = (n+ 5) + 1Vr(Gi, Ga). (4.6) 


The first term in (4.6) is a multiple of the identity, and as such does not 
contribute to the dynamics as far as the expectation values of observables 
are concerned. Leading-order dynamics is thus described by the second term, 
which does not depend on the Landau level n € N. Since Vp is a biperi- 
odic function and (G1, G2) a canonical pair, the second term is a Harper-like 


Motion of Electrons in Adiabatically Perturbed Periodic Structures 611 


operator. The spectrum of such operators exhibit a complex fractal behav- 
ior (Hofstadter butterfly) sensitively depending on the diophantine properties 
of a = £2 (notice that Vr(G1, G2) depends on a through the commutator 
[G1, G2] = iBy'1). The Cantor structure of the spectrum was proven first in 
[BS82] for the case Vr(x1, 22) = A cos x1 + cos x2 (Harper model), for a dense 
set of the parameter values. Later Helffer and Sjostrand accomplished a de- 
tailed semiclassical analysis of the Harper operator [HS89]. As a final step the 
Cantor spectrum has been proven by Puig (A 4 0, a Diophantine) [Pui04], 
and by Avila and Jitomirskaya [AJ05] for all the conjectured values of the 
parameters: À Æ 0, a irrational (the Ten Martini conjecture, as baptized by 
B. Simon). 

Secondly we turn to the opposite limit Bo — 0, where the slow part of the 
dynamics is still described by the magnetic momentum operators L; = v BoLj 
(j = 1,2), with commutator of order O( Bo). However the decomposition given 
by (4.2) and (4.3) is no longer convenient. 

Since Apo is a linear function, Ap(ex) = +€Bo A x, the slow variation limit 
E€ — 0 agrees with the weak field limit By — 0. We then pose £ = Bo and we 
regard Hyp in (1.4) as an adiabatic perturbation of the periodic hamiltonian 
(2.3). Thus we are reduced to the situation described in Sect. 3: to each isolated 
Bloch band of the unperturbed hamiltonian there corresponds a subspace 
IT,,-L?(R*) which is approximately invariant under the dynamics as € | 0. 
The dynamics inside this subspace is described by Peierls substitution (3.9), 
which now reads i 

W* [ho] = En(k — ze A (ieVe)), (4.7) 


as an operator acting in L?(T?,dk). Here Bo = (0,0,£) and e; is the unit 
vector in the i-th direction. 

Formula (4.7) shows that the leading order effective hamiltonian depends 
only on the operators (K1, K2) = K, 


1 
K=k- z N (iEVpk), 


which roughly speaking are the Fourier transform of the pair (Lı, Lə), and 
not on the complementary pair of operators. The same property holds true for 
the effective hamiltonian hĝ, at any order of approximation £ € N, see [FP06], 
with important consequences on the splitting of magnetic subbands at small 
but finite Bo. 

An operator in the form (4.7), shortly written En(K1, K2), is isospectral 
to an Harper-like operator, namely En(G1, G2) acting in L?(R). Indeed the 
first numerical evidence of the butterfly-like Cantor structure of the spectrum 
of Harper-like operators appeared when Hofstadter investigated the spectrum 
of cos Kı + cos Kə as a function of ¢ [Hof76]. On the other side, an operator 
of the form En(Kı, K2) is not unitarily equivalent to the Harper operator 
En(G1, G2). The important geometric and physical consequences of this fact 
are developed in [FP06]. 
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Having explained the two extreme cases, By — 0 and Bo — œ, the reader 
may wonder about the intermediate values of the magnetic field, Bo ~ 1. 
As explained already in Sect. 2.3 it is convenient to introduce the magnetic 
translations 

Ty = € 2 9(%) expliBoa - G), a € Íb, 


see (2.17) and (2.19). If Bo satisfy Assumption 2, then {Tą} is a commutative 
group, thus leading to the magnetic Zak transform (2.20). Hms is then a 
fibered operator over the magnetic Bloch momentum « € T?. At each « the 
spectrum of Hmg(«) is pure point and the corresponding eigenvalues €,2° are 
the magnetic Bloch bands. 

In view of this structure, one might argue that the adiabatic perturbation 
of the hamiltonian which includes, on top of the constant magnetic field Bo, 
a slowly varying magnetic potential A(ex) as in (1.5) can be treated with 
the methods of Sect. 3. There is however one crucial element missing. Indeed 
one can still associate to each magnetic Bloch band €2°, isolated from the 
rest of the spectrum, an almost-invariant subspace Ran MP0. On the other 
side the construction of the effective hamiltonian relies on smoothness which 
may be impeded for topological reasons. Indeed the analogue of Proposition 
2.3 is generically false for magnetic Bloch hamiltonians, as well-understood 
[DN80, Nov81, Lys85]. In geometric terminology this fact is rephrased by say- 
ing that the magnetic Bloch bundle is generically non-trivial (in technical 
sense). This important fact has sometimes been overlooked. For example, As- 
sumption B in [DGR04] is equivalent to the triviality of the magnetic Bloch 
bundle. Under this assumption the magnetic case is already covered by the 
results in [PST03b]. Thus the problem of adiabatic perturbation of a generic 
magnetic Bloch hamiltonian appears to be an open, in our view challenging, 
problem for the future. 


5 Piezoelectricity 


In the year 1880 the brothers Jacques and Pierre Curie discovered that some 
crystalline solids (like quartz, tourmaline, topaz, ...) exhibit a macroscopic 
polarization if the sample is strained. 

It turns out that also this effect can be understood in the framework of 
adiabatically perturbed periodic hamiltonians, cf. [PST06, Lei05]. The per- 
turbation is now slowly in time, 


1 
Hpg(t) = -34r + Vríet) (x, et). (5.1) 


If the potential V(x, et) has no center of inversion, i.e. there is no point with 
respect to which the potential has space-reflection symmetry, then the slow 
variation of the periodic potential is expected to generate a non-zero current 
and can be shown to do so for particular examples [ABL97]. By translation 
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invariance this current if averaged over a unit cell is everywhere the same and 
we denote the average current by J® (t). For the following discussion we assume 
that Vr varies only for times in the finite interval [0, T]. Integrating the current 
per volume over the relevant time interval yields the average polarization, 


T 
aps =f aO 
0 


In this section we discuss results that relate the current J*(t) directly to the 
quantum mechanics of non-interacting particles governed by the hamiltonian 
(5.1), without the detour via the semiclassical model. For this we need to solve 
the Schrédinger equation with initial state p(0) = P(0) being the spectral pro- 
jection of Hpg(0) below the Fermi energy E(0). Since the piezo effect occurs 
only for insulators, we can assume that E(0) lies in a gap of the spectrum 
of Hpp(0) and, in order to simplify the discussion, we also assume that this 
gap does not close in the course of time. Hence there is a continuous function 
E : [0, T] — R such that E(t) lies in a gap of Hpp(t) for all t. The state at 
time t is given by 
p° (t) = U (t, 0) P(0) U= (t, 0)", 


where the unitary propagator U*(t,0) is the solution of the time-dependent 
Schrödinger equation 


d 
ie U*(t,0) = Hpe(t) UF (t,0) with U*(0,0) =1. (5.2) 
With the current operator given by 
[H(t),2] = -7V2 (5.3) 
and the trace per volume defined as 
1 
T(A):= li ——Re Tr(1,4, 4 A 
(A) = lim, GReTL, A), (5.4) 


with 1,4, being the characteristic function of a 3-dimensional box A, with 
finite volume |A,,|, the average current in the state p*(t) is 


Finally the average polarization is 


ape = fae Te), (5.5) 
0 


which is the main quantity of physical interest. The given framework allows 
us to describe the macroscopic polarization of a solid by a pure bulk property, 
i.e. independently of the shape of the sample. 
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In the simplest but most important case (see Paragraph (ii) in Sect. 1 
for a discussion of the model), the periodic potential Vr(x,et) is periodic 
with respect to a time-independent lattice I’. For this case King-Smith and 
Vanderbilt [KSV93] derived a formula for AP based on linear response theory, 
which turned out to make accurate predictions for the polarization of many 
materials. Their by now widely applied formula reads 


J ee 
AP = oar i dk (An(k, T) — An(k,0)), (5.6) 


where the sum runs over all the occupied Bloch bands and A,(k,t) is the 
Berry connection coefficient for the n-th Bloch band at time t € R, 


An(k, t) = i(gn(k, ¢), ViePn(k, t)) L2) $ 


Although An depends on the choice of the Bloch function Yn, the average 
polarization (5.6) defines a gauge invariant quantity, i.e. it is independent of 
the choice of Bloch functions. 

In [PST06] we show that AP® defined in (5.5) approaches AP as given 
by the King-Smith and Vanderbilt formula (5.6) with errors smaller than any 
power of ¢, whenever the latter is well defined. More precisely we show that 
under suitable technical conditions on Vr(t) the average polarization is well 
defined and that for any N € N 


TEE E N 
AP =- ftf dk O(k,t) + OC"), (5.7) 


where 
O(k,t) := —itr (P(k,t)[O,P(k,t), VkP(k,t)]), (5.8) 


and P(k,t) is the Bloch-Floquet fiber decomposition of the spectral projector 
P(t) = 1(~20,6(4)|(Hpn(t)). Whenever all Bloch bands within RanP(k, t) are 
isolated, the explicit term in (5.7) agrees with (5.6). Note however that (5.7) 
is more general, since it can be applied also to situations where band crossings 
occur within the set of occupied bands. 

From the point of view of adiabatic approximation, this result is actually 
quite simple, since one just needs the standard time-adiabatic theory. At time 
t = 0 the state p(0) is just the projection P(0) onto the subspace of the iso- 
lated group of occupied bands. Since these bands remain isolated during time 
evolution, this subspace is adiabatically preserved according to the original 
adiabatic theorem of Kato [Kat50], i.e. 


p(t) = P(t) + Ole), 


and one can compute the higher order corrections to p*(t) using the higher 
order time-adiabatic approximation due to Nenciu [Nen93]. In order to get 
explicit results, one has to do the adiabatic approximation for each fixed 
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k € M* separately. This is possible since Hpg(t, k) is still fibered in k, due 
to translation invariance with respect to a time-independent lattice. However, 
since we need to differentiate with respect to k in order to compute the current, 
as suggested by formula (5.8), the expansion needs to be done uniformly on 
spaces of suitable equivariant functions. This makes the proof more technical 
than expected at first sight. 

Alternatively one can derive also for Hpg(t) the semiclassical equations of 
motion including first order corrections: 


ġ = ViEn(k, t) — € On(k, t), 


5.9 
k=0. A 
And again averaging the velocity over the first Brillouin zone yields the correct 
quantum mechanical average current that is the contribution from the n-th 
band. 

Note the striking similarity between the semiclassical corrections in (5.8) 
and the electromagnetic field. If we define the geometric vector potential 


An(k, t) =i (pn (k, t), VkPn(k, t))L2(m), 


and the geometric scalar potential 


nlk, t) = —i(gn(k, t), 3tPnlk, t))L2(Mm)» 


in terms of the Bloch function Yn (k, t) of some isolated band, then in complete 
analogy to the electromagnetic fields we have 


On(k, t) = —0;An(k, t) — Viebn(k, t), (5.10) 


and 
Nn(k,t) = Vp A Anlk, t). (5.11) 


Time-dependent deformations of a crystal generically also lead to a time- 
dependent periodicity lattice T(t), see (5.1). This more general situation is 
considered in [Lei05, LP06]. Now the lattice momentum k is no longer a con- 
served quantity and the full space-adiabatic perturbation theory is required 
in order to compute the corresponding piezoelectric current. As a result an 
additional term appears in the semiclassical equations of motion, reflecting 
the deformation of the lattice of periodicity. 
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1 Introduction 


This article is concerned with the numerical analysis of dynamical systems 
using methods that are based on a discretized description of the system as 
a graph. The graph-based description provides a unifying framework to ap- 
proach a wide and diverse variety of dynamical systems, from time-discrete 
maps via ordinary differential equations to stochastic differential equations 
describing e. g. diffusion in a potential landscape. 

Within this variety, this article focusses on those dynamical systems that 
can possess a ‘multiscale structure’ in the sense that they exhibit interest- 
ing dynamical behavior on more than one timescale. We will explain what 
we mean with this phrase by means of some examples. Consider in Fig. 1 
one trajectory of Chua’s circuit, that is described by the well-known three- 
dimensional ordinary differential equation 


1 
t = aly — moz — ze’) 
you-yte 
z= —ßy 


(see e. g. [HP*96]). It is clearly visible that relatively long parts of the whole 
trajectory are contained in two ‘leaves’ within which the trajectory shows a 
spiralling motion, with only some quick ‘jumps’ between the two leaves. 

Similar phenomena can be observed in systems of quite different mathe- 
matical type. As an example, consider a stochastic process in R” describing 
diffusion within a potential given by a function V : R” — R. The system is 
given by the Smoluchovski equation 


X(t) =-VV(X(t)) +W (t) 
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Fig. 1.1. A trajectory of Chua’s circuit that switches relatively rarely between two 
almost invariant sets. 


with W; being a standard n-dimensional Wiener process and £ a small param- 
eter. (A variation of this example is described in more detail in Sect. 4.) If we 
assume V(x) — oo for ||x|| > oo (in order to avoid sample paths drifting off 
to infinity), then any sample path will spend most of the time in the vicinity 
of the local minima of V, with transitions between the minima being rare 
events. 

The common feature of the Chua circuit example and of the diffusion 
example is the existence of subsets of the state space that are, although not 
being invariant under the dynamics considered, nevertheless almost invariant 
in the sense that on a short timescale, a change of a trajectory between the 
sets is an event rarely encountered. This suggests to analyze such systems on 
the short timescale as if those almost invariant sets were indeed invariant, 
concentrating on features of the dynamics within the sets, and neglecting 
outside interactions. On the long timescale, on the contrary, the dynamics 
of such systems can be considered as some kind of ‘flipping process’ between 
several almost invariant ‘superstates’. In this view, the first step of an analysis 
that separates different timescales is the identification of almost invariant sets 
in the dynamics, which forms the prime motivation for the work presented in 
this article. 

As was already alluded to in the beginning, we choose the approximation 
of continuous dynamics through discrete Markov chains as the unifying ap- 
proach to various kinds of dynamical systems. Reading a transition matrix 
as the adjacency matrix of a graph naturally transforms the situation into a 
graph theoretic framework. The problem of identifying almost invariant sets 
thus becomes the problem of finding partitions of a graph that are optimal 
with respect to certain cost functions, for which a plethora of solution or 
approximation methods is at hand. 
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The remainder of this article is organized as follows. In Sect. 2, together 
with some notation we introduce the basic concepts used in this work, in par- 
ticular the concept of almost invariant sets that is central for the contents 
of this article. We formulate the problem of identifying almost invariant par- 
titions and then reformulate it first as a discrete optimization problem and 
then as a graph-theoretic problem. Sect. 3 takes up the last formulation and 
introduces algorithmic possibilities graph theory offers for the solution of the 
problem. We pay particular attention to the concept of the congestion of a 
graph with its connection to dynamical systems concepts. Sect. 4 illustrates 
the use of shortest-path-algorithms for the detection of dynamically meaning- 
ful transition paths between almost invariant sets. Here the crucial point is 
the appropriate choice of edge weights in the graph, for which two particular 
examples are presented. 


2 Numerical Analysis of Dynamical Systems 


In this section we introduce the concept of almost invariant sets of a dynamical 
system, and we describe a standard framework for their numerical analysis 
using hierarchical set oriented methods. 


2.1 Dynamical Systems and Invariant Measures 


A map f : X — X on a compact subset X C R” defines a discrete-time 
dynamical system with state space X. Trajectories of the system are sequences 
of points in X of the form 


Tk+1 = f (xz), k=0,1,... : 


A particularly important class of such maps f is that of time-T maps of 
an ordinary differential equation. In this case, under mild assumptions on 
the ODE (local existence and uniqueness) f is even a diffeomorphism; in the 
following we will assume this to be the case. Note that the state space X need 
not be the maximal domain of the map f. In many cases it is more appropriate 
to consider the dynamical system on some (invariant) subset of the maximal 
domain, e. g. an attractor, an ergodic component, the set of non-wandering 
points or the chain recurrent set. 

In this work, we are interested in questions about the global dynamical 
behavior of the dynamical system f : X — X. A powerful approach to these 
questions is to use the transfer operator (or Perron-Frobenius operator) asso- 
ciated with f, which, instead of generating single trajectories of points in X, 
describes the evolution of sets or, more generally, of (signed) measures on X. 
More precisely, the transfer operator associated with f is the linear operator 
P:M =M, 
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(Pv)(S) = v(f~*(S)), S measurable, 


on the space M of signed measures on the Borel o-algebra over X. 
In the following we will assume that yu is an invariant measure for f, that 
is, the probability measure p satisfies 


u(S) = w(f~1(S)) = (Pu)(S) for all measurable S C X , 


and thus is a fixed point of the transfer operator. Moreover we assume that p is 
a unique so-called SRB-measure (Sinai-Ruelle-Bowen) in the sense that this is 
the only invariant measure which is robust under small random perturbations, 
in other words the only physically relevant invariant measure for the dynamical 
system f. 


2.2 Almost Invariant Sets 


For two measurable sets S1, S2 C X we define the transition probability p from 


Sı to Sp as 
(S1 N f~*(S2)) 
(S1) 


whenever u(S1) # 0. The transition probability p(S) := p(S, S) from a set 
S C X into itself is called the invariance ratio of S. If for a number ô € [0,1] 
the relation 


p(S1, S2) := 


p(S) > 


holds, S is called an 6-almost invariant set. In practice, we will be interested 
in numbers ô = 1 — e with 0 < € << 1. When no precise bound 6 on the 
invariance ratio is important, we will also simply speak of almost invariant 
sets. 

The following observation will be crucial for the rest of this article. Let S 
be an 6-almost invariant set, with 6 = 1 — e. From u(S) = u(f—'(S)) one has 
on the one hand that 


u(S) = FTS) = SN FTS) + X\ SAF“ (S)). (21) 
On the other hand, 
MX \ 8) = w(X\ SN FHS) + UXAN SN FX \S)). (22) 
As S is 6-almost invariant, (2.1) means that 
U(X \ SN FT(S)) < eu(S) (2.3) 
which together with (2.2) implies that 


iaon a 8)) > W(X \ $8) — en(S) 
_f{, HS) 
=( g) HENS); 
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and thus (5) 
H 
AN9 (era) 


In short, this means that the complement of an almost invariant set is also 
almost invariant, with the respective invariance ratios being the more similar 
the closer the ratio es is to one. 

This observation naturally motivates the problem of determining a parti- 
tion of X consisting of almost invariant sets of roughly equal weight. For the 
rest of this article we will be concerned with this problem. 

Although it may seem obvious, it is important to note that unlike e. g. 
the decomposition into ergodic components, which is unique for any given 
system, the decomposition of the state space into almost invariant sets will 
in general not be unique. In fact, any small (with respect to e. g. Lebesgue 
measure) variation of an almost invariant set will also be an almost invariant 
set, probably with a slightly different invariance ratio. 

We now formally define the problem of finding a partition of almost invari- 
ant sets in the spatially continuous setting we have been considering so far. 
It will be reformulated twice in the course of this article, first for a spatially 
discretized setting and later in the language of graph theory. 


Problem 2.1. For some fixed p € Nt find a collection of pairwise disjoint 
sets S = {S1,..., Sp} with Uj<j<, Si = X and p(S;) > 0, 1 < i < p, such 
that D 


p(S) := > (5) — max. 


2.3 Discretization of the Transfer Operator 


For the detection and approximation of almost invariant sets we need to ex- 
plicitly deal with the transfer operator. Since an analytical expression for it 
will only be derivable for none but the most simple systems, we need to derive 
a finite-dimensional approximation to it. The following description is based 
on results from e.g. [DH*97, DJ99, DFJ01, DJ02]. 

The basic idea for the discretization is to construct a sufficiently fine cov- 
ering of the state space of the system consisting of boxes, i. e. generalized 
rectangles, by means of a subdivision algorithm as described in [DH97]. The 
basic principle of the subdivision algorithm is as follows. One starts with a 
box Q D X containing the state space. Setting Bo = {Q}, a sequence (Bn) nen 
is iteratively constructed, with each iteration step i — i + 1 of the iteration 
consisting of two parts. In the first part, from the collection B; a new collec- 
tion Bi+ı is constructed by subdividing each box B € 6; along a prescribed 
coordinate axis into two new boxes. In the second part, Bi+ı is constructed 
as the collection of those boxes that do intersect with X, i. e. 


Bur = {Be Bin | BOX #0} 
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There are several modifications of this scheme, in particular regarding the 
choice of boxes to be subdivided. While in the simple scheme every box is 
subdivided in every step, one can reduce the numerical effort by introducing 
an additional selection criterion that decides which boxes to subdivide and 
which ones to leave at the present level. More detailed expositions of the 
subdivision scheme can be found e. g. in [DH97, DJ99, Jun01). 

Of course, in practice one cannot infinitly go on with the construction of 
an arbitrarily fine box covering, but will have to stop the process at some 
level, which results in a covering of the state space X by a finite collection 
B= {B,,..., By} of boxes, i. e. 


b 
Xc|]JB: with m(B,NB;)=0 fori#žj. 


i=l 


Here m denotes Lebesgue measure. 

To discretize the transfer operator, we replace the space M of signed 
measures over the Borel o-algebra by the finite-dimensional space Mg of 
signed measures on the o-algebra that is given by the set of arbitrary unions 
of boxes in B. The standard basis for this vector space is given by those 
measures that assign the weight 1 to precisely one box in 6 and 0 to all other 
boxes. 

With respect to this basis, the discretized transfer operator Pg : Mg —> 
Meg is represented by the matrix of transition probabilities 


m(f7*(Bi) N B3) 


>, I1<ij <b. 2.4 
m(B;) aa 


Pg = (Pij), where Pij = 

In the compution of the transition probabilities pij, the denominator poses 

no problem, as the boxes B; are generalized rectangles. For the computation 

of m(f~!(B;) N B;), that is, the measure of the subset of B; that is mapped 

into B;, there are several possibilities described e. g. in [DFJ01]. A method 
that is often used is the Monte Carlo approach as described in [Hun94]: 


K 
m(F-\(Bi) By) = Doxa, (F(a) 
k=1 


where the 2z,;’s are selected at random in Bj from a uniform distribution. 
Evaluation of yg,(f(v%)) only means that we have to check whether or not 
the point f(xz;,) is contained in B;. There are efficient ways to perform this 
check based on a hierarchical construction and storage of the collection B (see 
[DH97, DH*97]). 

Note that once we have computed an approximation Pg of the transfer 
operator we can obtain a discretized version of the natural invariant measure 
u of the box covering B of A as the eigenvector to the eigenvalue 1 of Pg. 

As described in the beginning of this section, a region will be of interest if it 
is almost invariant in the sense that typical points are mapped into the region 
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itself with high probability. Evidently the infinite dimensional optimization 
problem 2.1 needs to be discretized in order to be treated numerically. To this 
end we again restrict ourselves to subsets that are unions of elements of the 
partition B. Consider the transition matrix Pg from (2.4). Then, our goal in 
the discretized setting is to solve the following problem. 


Problem 2.2 (Boxes). For some p € N* find a collection of pairwise disjoint 
sets S = {51,...,5,} with Ur<i<p Si = B and u(Sk) > 0, 1 <i < p, such 
that A 


—> max. 


p(S) = 


R I= 


sae ) ee > Bi,ByCSe Pij < W(B;) 
a a a 
k=1 Pir Aaea E 


2.4 Graph Formulation 


In this section, we go one step further with reformulating the problem of 
finding almost invariant sets of a dynamical system. As it turns out, the 
optimization problem 2.2 can be translated into the problem of finding an 
optimal cut in a graph. To see this, we first show how the matrix describing 
the discretized transfer operator can also be understood as a matrix describing 
a directed graph, and then show that the quantity p(S) can be naturally 
expressed in terms of edge and vertex weights of the graph. 

As in the previous section, let B be a box covering of X. Let G = (V, E) 
be a graph with vertex set V = B and directed edge set 


E = E(B) = { (B1, Bo) EBxB: f(Bi)NB. #0}. 


The function vw : V — R with vw(B;) = pu(B;) assigns a weight to the 
vertices and the function ew: E — R with ew((B;, B;)) = u(Bi)pji assigns a 
weight to the edges. Furthermore, let 


B = E(B) = {{B1, Bo} € B : (f(B1) N Ba) U (f(B2)N Ba) # OF. 


This defines an undirected graph G = (V, E) with a weight function ew : 
E > R with ew({B;, B;}) = u(B;)pi; + u(Bi)pji on the edges. The difference 
between the graphs G and G is that in G the edge weight between two vertices 
is the sum of the edge weights of the two directed edges between the same 
vertices in G. Thus, the total edge weights of both graphs are identical. 

To formulate the problem of partitioning the state space into almost in- 
variant sets, we will define two cost functions that describe how much weight 
remains within a certain set on the one hand, and how much weight changes 
the set of a partition on the other hand. In order to so, we first need some 
more notation and write u(S) = Jes (Bi) for S C V, and with S= V \ S 
we further denote 


1 
Es,s = 5 u(Bi)pij and Egg = 3 5 H(Bi)pij + u(Bi)Pji- 
i jES iES,jES 
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Definition 2.3. For a set S C V we define 


Ess 
Cin S) = —= 
45) H(S) 
as the internal cost of S, and 
Es 5 
Centl S) = ——— = 
~ WS) 46S) 


as the external cost of S. 


Note that both cost functions are independent from the choice between 
the directed graph G or the undirected graph G. We can therefore choose the 
simpler undirected graph, and we will do that in the following. 


Definition 2.4. For a partition S = {S,...,Sp} of V we define 
iw 
Cine(8) = = XO Cine(S:) (2.5) 
i=l 


as the internal cost of S, and 


_ Lisicj<p SiS; 


Cext(S) = P MEA (2.6) 


as the external cost of S. 


Intuitively, optimal almost invariant partitions have maximal internal cost 
and minimal external cost. Therefore, both the internal and external costs 
are useful cost functions for the problem of computing almost invariant sets. 
However, the minimization of the external cost is not equivalent to the max- 
imization of the internal cost. In fact, the maximization of the internal cost 
favors in general parts that are on average very loosely coupled to the rest 
of the system. However, the size of these parts can in principle become very 
small. On the other hand the minimization of the external cost favors balanced 
weighting of the components. 

It is an easy task to check that p(S) = Cint(S). Thus, the optimization 
of problem 2.2 is identical to the optimization of the internal costs of the 
partition S in equation (2.5) written in graph notation. Therefore, we have 
established the following graph partitioning problem. 


Problem 2.5 (Graph). For some fixed p € N* find a collection of pairwise 
disjoint sets S = {51,..., Sp} with Uj <;<, Si = V and vu(S;) > 0,1 <i<p, 
such that >. 
Cint(S) —> max . (2.7) 
One can also consider the analogous problem for the external cost function. 
Problem 2.6 (Graph). For some fixed p € N* find a collection of pairwise 
disjoint sets S = {51,..., Sp} with UY, <;<, Si = V and vu(S;) > 0,1 <i<p, 
such that a 
Cext(S) —> min . (2.8) 
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3 Computation of Almost Invariant Sets as a Graph 
Partitioning Problem 


In this section we show how existing graph partitioning methods and tools can 
be applied to compute almost invariant sets. We first describe some state of the 
art graph partitioning heuristics. Then, we introduce the notion of congestion 
in a graph and its use in the analysis of dynamical systems, in particular in 
view of the problem of finding a partition of almost invariant sets. Finally, we 
explain how the congestion can be used as a criterion to decide on the number 
of almost invariant sets. 


3.1 Graph Partitioning Heuristics 


In this section we briefly review existing approaches and algorithms for parti- 
tioning the vertex set of a graph. As most variations of the partitioning prob- 
lem — including those we are interested in in this article — are NP-complete, 
the algorithms we present are approximation algorithms that are often based 
on some heuristic for obtaining good partitionings. 

The existing methods and tools for graph partitioning do not exactly op- 
timize the cost functions we introduced in the previous section. We therefore 
give an overview of the most successful graph partitioning methods and im- 
plementations and point out the necessary modifications to such tools. 

For the remainder of this section we assume that we aim to partition a 
the vertex set of a graph into a known number of p parts. In Sect. 3.3 we will 
present a way to identify this number. 

We want to calculate a partition of the vertex set V of a graph G = (V, E) 
into p parts V = S1 U---US, such that one of our cost functions of equa- 
tions (2.5) or (2.6) is optimized. However, the calculation of an optimal solu- 
tion of both cost functions is NP-complete. Another widely discussed parti- 
tioning problem is to minimize the cut size cut(™) = 1) <i<j<p Esi,8; of the 
partition 7 under the constraint that all parts have an equal (or almost equal) 
number of vertices. This problem is sometimes called Balanced Partitioning 
Problem and is NP-complete, even in the simplest case when a graph with 
constant vertex and edge weights is to be partitioned into two parts [GJ79]. 

Efficient graph partitioning strategies have been developed for a number 
of different applications. Efficiency and generalizations of graph partitioning 
methods strongly depend on specific implementations. There are several soft- 
ware libraries, each of which provides a range of different methods. Examples 
are CHACO [HL94], JOSTLE [Wal00], METIS [KK98a], SCOTCH [Pel96] or 
PARTY [Pre98]. The goal of the libraries is to both provide efficient imple- 
mentations and to offer a flexible and universal graph partitioning interface to 
applications. These libraries are designed to create solutions to the balanced 
partitioning problem. 

The tool PARTY has been developed by one of the authors and we have 
used it for partitioning the graphs in this paper. PARTY, like other graph par- 
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titioning tools, follows the Multilevel Paradigm. The multilevel graph parti- 
tioning strategies have been proven to be very powerful approaches to efficient 
graph-partitioning [Bou98, Gup97, HL95, KK98b, KK99, MPDO00, PM*94, 
Pre00]. The efficiency of this paradigm is dominated by two parts: graph 
coarsening and local improvement. 

The graph is coarsened down in several levels until a graph with a suffi- 
ciently small number of vertices is constructed. A single coarsening step be- 
tween two levels can be performed by the use of graph matchings. A matching 
of a graph is a subset of the edges such that each vertex is incident to at most 
one matching edge. A matching of the graph is calculated and the vertices 
incident to a matching edge are contracted to a super-vertex. Experimental 
results reveal that it is important to contract those vertices which are con- 
nected via an edge of a high weight, because it is very likely that this edge 
does not cross between parts in a partition with a low weight of crossing edges. 
PARTY uses a fast approximation algorithm which is able to calculate a good 
matching in linear time [Pre00]. 

PARTY stops the coarsening process when the number of vertices is equal 
to the desired number of parts. Thus, each vertex of the coarse graph is one 
part of the partition. However, it is also possible to stop the coarsening process 
as soon as the number of vertices is sufficiently small. Then, any standard 
graph partitioning method can be used to calculate a partition of the coarse 
graph. 

Finally, the partition of the smallest graph is projected back level-by-level 
to the initial graph. The partition is locally refined on each level. Standard 
methods for local improvement are Kernighan/Lin [KL70] type of algorithms 
with improvement ideas from Fiduccia/Mattheyses [FM82]. An alternative 
local improvement heuristic is the Helpful-Set method [DMP95] which is de- 
rived from a constructive proof of upper bounds on the bisection width of 
regular graphs [HM92, MD97, MPO01). 

As mentioned above, the tools are designed for solving the balanced graph 
partitioning problem. Thus, the optimization criterion is different from our 
cost functions of Sect. 2.4. The coarsening step of the multilevel approach does 
not consider the balancing of the weights of the super-vertices. It is the local 
refinement step which not only improves the partition locally but also balances 
the weights of the parts. Thus, we have to modify the local improvement part 
of the multilevel approach. We therefore modified the Kernighan/Lin imple- 
mentation in PARTY such that it optimizes the cost-function Cint. Overall, 
we use the algorithm of Fig. 3.1 to calculate almost invariant sets. 

As an example to illustrate the partitioning we consider a graph that was 
obtained as the discretization of the dynamics of a pentane molecule that is 
considered in detail in [DH*00]. This molecule has two dihedral angles which 
are used as state space coordinates. The left plot of Fig. 3.2 shows the box 
collection and all transitions between boxes. As we will see in sect. 3.3, it is 
adequate to partition the graph into five or seven parts. These are shown in 
the center and right plots of Fig. 3.2. 
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Partition graph Go = (Vo, Eo) into p parts 
i = 0; 
WHILE (|V:| > p) 
calculate a graph matching M; C Ei; 
use M; to coarse graph G; = (Vi, Ei) to a graph Giga = (Vi+1, Ei+1); 
i:=i +1; 
END WHILE 


let m; be a p-partition of G; such that each vertex is one part; 
WHILE (i > 0) 

i :=i— l; 

use M; to project 7:41 to a p-partition 7; of Gi; 

modify the partition m; on G; locally to optimize Cint(7:); 
END WHILE 
output 70. 





Fig. 3.1. Computing a graph partitioning with the multilevel approach. 
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Fig. 3.2. Partition of graph describing the dynamics Pentane300. Left: The graph 
with all edges. Center: Partition consisting of five parts with Cint = 0.980. Right: 
Partition consisting of seven parts with Cint = 0.963. (See page 695 for a colored 
version of the figure.) 


3.2 Congestion 


Standard graph partitioning methods partition the graph into a predefined 
number of parts. However, if we do not know the resulting number of almost 
invariant sets a priori, we need to find mechanisms which help us to decide on 
a natural number of parts. As we will see, such a mechanism can be devised 
on the basis of the concept of congestion of a graph. 

Intuitively, the congestion of a graph is a quantity that can be used to 
identify ‘bottlenecks’ in the graph, i. e. edges that connect subgraphs which 
have relatively many internal and relatively few external edges. This descrip- 
tion already explains the relevance of the congestion for the problem of finding 
almost invariant sets of a dynamical system. The concept is based on the idea 
of so-called multi-commodity flows on the graph (see e.g. [Lei92, Sin93}). 
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In the following, we first formally define the congestion. As it is often 
not feasibly to precisely compute this quantity for a given graph, we will 
then shortly discuss heuristics for an approximation of the congestion. These 
heuristics will produce upper bounds on the congestion, which we can use in 
a lower bound on the external cost of a bisection of a graph that is discussed 
immediately afterwards. 

The first concept we need for the definition of the congestion is that of a 
single-commodity flow in a graph. Such a flow may be imagined as a way of 
describing the transport of a certain quantity c of some good from a source s 
to a target t through a network of roads that is given by the graph. 


Definition 3.1. Let G = (V,E) be an undirected graph with the vertex set 
V = {i1,...,d}. Let s,t € V be vertices of G with s #4 t, letc Ee R.A 
single-commodity flow f of the commodity c from s tot on G is a function 
f:V xV —R such that 


i. f(v, w) = 0 for all {v,w} € E (flow on edges only), 
ii. f(v, w) = —f(w,v) for all v,w E V (symmetry), 
iii. X f(v,w)=0 for all v € V\{s,t} (flow conservation) and 


wEV ( 
w. > f(s,w) = X flw,t)=c. 


wEV wEV 

As the next step, we will define multi-commodity flows which general- 
ize single-commodity flows. Intuitively, a multi-commodity flow describes the 
transport of certain commodities from every vertex to every vertex of the 
graph. The formal definition is as follows. 


Definition 3.2. Let cs; E R for 1 < s,t < d. A multi-commodity flow F 
of the commodities cs, on G is a function F : V x V x V x V — R such that 
for each pair (s,t) E€ V xV, the function F(s,t,-,-) is a single-commodity flow 
of the commodity cs from s tot on G. 


With these concepts, we are in a position that allows us to introduce the 
congestion of a graph as we are using it in this work. 


Definition 3.3. Let G = (V, E) be an undirected graph with the vertex set 
V = {1,...,d}, vertex weights uł fori € V, and edge weights Aij for {i,j} € 
E. For s,t € V, let cs+ = ut: u$. Denote by F the set of all multi-commodity 
flows of the commodities cs, on G. For an edge {v,w} € E and F € F, the 
edge congestion of {v,w} in F is 


F(s, t, v, w 
cong({v, w}, F) = Zilei (3.1) 


The flow congestion of F on G is 


cong(F) = p cone w}, F), (3.2) 
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and finally the congestion of the graph G is 


G) = mi F). 3.3 
cong(G) = min cong(F) (3.3) 

In principle, other choices of commodities c,; than those used in this 
definition are also possible. However, the choice we made here seems the most 
appropriate for the use we will make of the congestion in this paper, as will 
become clear in the following section. 


Approximation of the congestion 


The computation of the congestion cong(G) of a graph can be costly and is 
often infeasible. We will see in the next section that the congestion can be 
used to bound the external cost of a partition. However, that bound holds for 
the congestion cong(F’) of any multi-commodity flow F. Thus, a sub-optimal 
flow produces a sub-optimal, but still valid bound. In this section we discuss 
heuristics for calculating a flow with a small flow congestion. 

There are some hints of how to construct a flow with a small flow con- 
gestion. Clearly, cycles in the flow should be avoided. Furthermore, it is easy 
to imagine that a low-congestion flow should - at least primarily - go along 
shortest paths between the pairs of vertices. Here, the length of a path is the 
sum of the reciprocal values of the edge weights along the path. 

A straightforward method is to send the flow along shortest paths only. If 
more than one shortest paths exist, the flow value can be split among them. 
If the edge weights are constant, all shortest paths can be calculated in time 
O(|V| - |E|). This can be done by |V| independent Breath-First searches in 
time O(|E]) each. If the edge weights are non-negative, all shortest paths can 
be calculated in time O(|V] - (|V|-log|V| + |E])), e.g. with |V| runs of the 
single-source shortest path Dijkstra algorithm using Fibonacci heaps. We refer 
to [CLR90] for a deeper discussion of shortest paths algorithms. 

A different method is to consider n commodities at a time. For each vertex 
Us, 1 < s < n, consider the commodities cs, 1 < t < n, i.e. all commodi- 
ties with vs as the source. For each source v, we calculate a flow F, which 
transports all commodities from vs to all other vertices, i.e. it replaces n single- 
commodity flows such that F, (v, w) = X}; F(s,t, v, w). Fs is a single-source, 
multiple-destination commodity flow. Definitions (i.) and (ii.) for the single- 
commodity flow (Definition 3.1) remain unchanged whereas the definitions of 
(iii.) and (iv.) are replaced by 

v. >> F(u, w) = —cs for all t # s (from source s to target t) and 
wEV 


n 
vi. SO F3(vs,w) = >> Cst — Cs s (from source s to all targets t except to 
wEV t=1 
source s itself). 


It is left to show how we calculate a single-source flow Fs. We use algo- 
rithms from diffusion load balancing on distributed processor networks for 
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this task, see e.g. [DFM99, EF*99, EMPO0]. Here, the problem is to balance 
the work load in a distributed processor network such that the volume of data 
movement is as low as possible. We use these algorithms in the setting that the 
vertex Vs models a processor with load ies Cs, and all other vertices model 
a processor with no load. Furthermore, the processors are heterogeneous with 
a capacity of cs for processor vs [EMP00]. The diffusion algorithms calculate 
a balancing flow such that each vertex/processor v+ gets a load of cs. That is 
exactly what we need in our context. The resulting balancing flow has a nice 
property: it is minimal in the l2-norm, i.e. the diffusion algorithms minimize 
the value 4/ 1<y,w<n |Fs(v, w)|. This ensures that there are no cycles in any 


flow Fs. Furthermore, the flows are not restricted along shortest paths and 
can avoid high traffic along shortest paths. However, the flows are still fa- 
vored to be along reasonably short paths. Thus, it is expected that the overall 
edge congestion of the resulting flow is reasonably small and that the flow 
congestion is close to the congestion of the graph. 

The PARTY library includes efficient code of a variety of diffusion algo- 
rithms. We use them to calculate the single-source, multiple-destination flows 
for each source s and then add up the values to get the multi-commodity flow. 
Numerical experiments indicate that the resulting flow is indeed very small. 


An Example: Pentane 


To illustrate the meaning of the congestion in the context of dynamical sys- 
tems, we again consider as example the graph describing the dynamics of a 
pentane molecule from [DH*00]. The graph corresponding to this dynamical 
system is shown in the left part of Fig. 3.3. The middle and the right part of 
Fig. 3.3 show the edges with low and with high congestion, respectively. The 
coloring of the boxes indicates the partition into seven almost invariant sets. 
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Fig. 3.3. Congestion of the Pentane. Left: all transitions. Center: only transitions 
with a low congestion. Right: only transitions with a high congestion. (See page 695 
for a colored version of the figure.) 
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It can be observed that — as expected — edges with low congestion can 
mainly be found inside the almost invariant sets. On the other hand edges 
between different almost invariant sets have a large congestion. Thus, a high 
congestion indicates that there are at least two regions in the phase space 
which are only loosely coupled. As we will see in Sect. 3.3, this observation 
is the basis for using the congestion as an identifier for the number of almost 
invariant sets which have to be approximated. 


The congestion bound on Cext 


We will now see how the concept of congestion of a graph can be used for 
the analysis of dynamical systems, in particular for the problem of finding a 
partition into almost invariant sets. 

The congestion can be used to derive a lower bound on Cex¢(.S) for any 
S C I. As before, we use multi-commodities Cs, = vw(vs) - vw(v;) for each 
source Vs € V and each destination v, € V. On the one hand side, for any 
multi-commodity flow with commodities cs,¢ as given above, at least u(S)-u(5) 
‘units’ have to cross the cut between S and S, and as many in the opposite 
direction. On the other hand, with Eg 3 being the sum of edge weights of 
edges crossing the cut, by definition of the congestion, at most cong(G)- Es g 
units can cross the cut. Therefore we have 2- (S) - u(S) < cong(G) - Egg, 
which at once gives the important inequality 


Ess a oy 
u(S)-u(S) © cong(G)’ 


Obviously, a high and tight lower bound can only be achieved with a small 
congestion. Although the congestion can be computed in polynomial time, it 
remains to be very costly. Nevertheless, the congestion can be approximated 
by the congestion of any flow. Heuristics for calculating a small congestion 
were discussed above. Further discussion of lower bounds based on different 
variations of multi-commodity flows can be found in [Sen01]. 


Coxt(S) = (3.4) 


3.3 Identification of the Number of Almost Invariant Sets 


We now discuss the problem of identifying an appropriate number of almost 
invariant sets a given space should be partitioned into. 

Informally, we want to determine a number p € N such that there is a 
partition of V consisting of p parts V = S,U---US> with a high internal cost. 
As the internal cost is monotonically decreasing for the optimal partitions with 
an increasing number of parts p, we are looking for a number p such that an 
(almost) optimal partition into p— 1 parts has an only slightly higher internal 
cost than an (almost) optimal partition consisting of p parts while (almost) 
optimal partitions into p+ 1 parts have a substantially lower internal cost. 
The idea is that if we try to split a compact set (one with a small congestion), 
the internal cost will drop substantially. Thus, our strategy is to start with 
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the whole vertex set as the initial set and keep on bisecting the sets until they 
become compact sets. This leads us to a strategy of how to determine the 
number of compact parts. It can be phrased as a general method: 


Recursively bisect the vertices of the graph until all parts are compact. 


Recursive bisection is a widely used technique in graph partitioning. Al- 
though there are many partitioning methods which directly partition the ver- 
tices of a graph into a number of parts, we cannot apply them here, because 
we do not know the number of parts a priora priori. Aditionally, the graph 
bisection methods are often much more efficient than their generalized coun- 
terparts. 

We have seen in the previous section that the congestion of a graph can be 
used to derive a lower bound on the external cost of a set bisection, i.e. a large 
congestion indicates a large external cost and, therefore, also a small internal 
cost. We use the congestion in order to decide whether a set is compact or not. 
In our experiments we use a threshhold of 5 and say that if the congestion 
is larger than 5 than the set has at least one bottleneck and is not compact. 
Thus, our strategy is to subdivide the parts until all parts have a congestion 
of at most 5. 

One needs to solve two tasks in order to follow the recursive bisection 
strategy and we described both in the previous sections. We use the methods 
described in Sect. 3.1 to recursively calculate bisections of a graph. Further- 
more, we use the congestion in order to indicate whether a part is compact 
or not. 

Figure 3.4 illustrates the recursive bisection process in the partitioning of 
the graph in the pentane molecule example from [DH*00] which we already 
used before. From the top left to the bottom right picture, the levels of the 
recursive procedure are shown rowwise. As we can observe in the first picture, 
the first bisection results in one part of 43 boxes and a very low congestion of 
0.88. However, the other part consisting of 212 boxes has a high congestion 
of 168.82. We continue to bisect parts with a congestion value higher than 5. 
Thus, after a total of 4 bisection levels we get a partition into 7 parts and the 
highest congestion of any part is 3.67 . 


4 Short Paths 


Broadly speaking, the previous section has been concerned with the use of 
graph partitioning algorithms to obtain information about almost invariant 
sets of a dynamical system. In this section, we will consider the use of another 
class of graph algorithms, namely that of shortest-path algorithms, in the 
context of dynamical systems. We will see that such algorithms can be used 
to compute discrete approximations to transition paths of a dynamical system. 
The crucial question for this undertaking is the choice of a weight function that 
defines the length of an edge. We will present two such functions for different 
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Cint = .987 Cint = .983 
43/.13/.977/.88 43/.13/.977/.88 
212/.87/.997/168.82 64/.13/.979/8.50 
148/.74/.993/139.73 
Cint = -970 Cint = .963 
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43/.13/.977/.88 43/.13/.977/.88 
45/.12/.977/.77 45/.12/.977/.77 
19/.01/.927/.06 19/.01/.927/.06 
67/.15/.982/10.18 20/.01/.921/.07 
81/.59/.988/75.12 47/.13/.980/1.09 


43/.12/.979/1.18 
38/.47/.982/3.67 
Fig. 3.4. Recursive bisection of the graph for the pentane example. The values 
indicate: number of boxes / invariant measure / internal cost Cint / congestion of 
subgraph. The graph has 255 vertices and a congestion of 139.67. (See page 696 for 
a colored version of the figure.) 


types of dynamics. Both have a natural motivation, and we will compare the 
results of both approaches. 


Definition 4.1. Let G = (V,E) be any graph with edge weights given by a 
function ew : E — R. A sequence [(v1, v2), (v2, V3),--- (Vi, vi41)] of edges 
(vj, vj+1) E E, 1 < j < i, is called a path from vertex vı to vertex vi41 of 
size i and of length l = Des ew( (vj, vj+1)). A shortest path from a vertex 
vs to a vertex va is a path of minimum length from vs to vg. The distance 
dist(vs, va) from vs to va is the length of a shortest path from vs to va. 


The standard algorithm used for computing shortest paths in graphs is the 
Dijkstra algorithm. It solves the so called Single Source Shortest Path Problem 
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where the shortest paths from one source vertex vs € V to all other vertices 
v € V have to be determined. The Single Source, Single Destination Shortest 
Path Problem is a special case in which only one path from v, to a designated 
destination vertex vg has to be determined. In both cases the runtime of the 
Dijkstra algorithm is O(|V|log(|V|) + ||). For a profound discussion of this 
standard algorithm we refer to e. g. [CLR90], in the following we will only 
roughly sketch its basic principle. 

Given a vertex vs as starting vertex, the algorithm maintains a list of 
distances to vs assigned to every other vertex that is initialised with the value 
co and in the end contains the lengths of the shortest paths from v, to any 
vertex. In the first step, the distances of all neighbors of vs are set to the 
weight of the edge connecting them to vs. These vertices form the initial halo 
set, i.e. they are the vertices for which one path from vs is known but it 
is not known whether this path is a shortest path. In the main loop of the 
algorithm, it removes a vertex Umin With the minimum known distance from 
the halo set, and considers all neighbors of Vmin. If a neighbor is also in the 
halo set, the algorithm checks whether a path through Vmin would result in a 
distance from vs less than the current known distance. If a neighbor is not yet 
in the halo set, it is added to it, with its distance value being the sum of the 
distance of Vmin and the length of the edge connecting the neighbor to Umin. 
The algorithm terminates when a prescribed target vertex is reached or when 
the halo set becomes empty. 

By two slight modifications, the Dijkstra algorithm can be generalized to 
find a shortest path from any vertex of a source set V; C V to any vertex of 
a destination set Va C V. The first modification is that in the initialization 
step all vertices of V; are assigned the distance value 0, and that all neighbors 
of vertices from V, that do not themselves belong to V, form the initial halo 
set. The second modification is that in the main loop, every time a vertex v 
is removed from the halo set, it is checked whether v € Va. 


4.1 Several Short Paths 


For the purposes of this article, the purely graph theoretic consideration of 
shortest paths we have seen until now has to be extended by some ideas 
related to the specialized setting of graphs describing (temporal and/or spa- 
tial) discretizations of continuous dynamical systems. In particular we have in 
mind the fact that the numerical realizations of these graphs necessarily come 
with a discretization error which makes it doubtful whether the notion of the 
shortest path between two vertices vs and vg is really a meaningful quantity 
in our applications — even leaving out the possible existence of several shortest 
paths. Therefore, we are not only interested in one (or all) precisely shortest 
paths, but we are also interested in all paths which are only slightly longer 
than a path with the shortest length. 

For this reason, we want to calculate all paths from a vertex v, to a vertex 
va which have a length of at most (1 + €) dist(vs, va). In order to do so, we 
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need to apply the Dijkstra algorithm only two times. Firstly, we calculate all 
distances from vs to all other vertices and denote these distances by dist (v) 
for all vertices v € V. Among all distances this also includes the distance 
between vs and vg. Secondly, we consider a new graph Gy = (V, F) where 
F consists of the edges in E with direction reversed. Then, we calculate all 
distances from vg to all other vertices in G,, and denote these distances by 
dist2(v) for all vertices v € V. Note that distg(v) is also the distance from v 
to vg in G for any vertex v € V. 

It is now simple to decide whether or not an edge (v;, vj) lies on a path 
between vs and vq of length at most dist(vs, va)(1 + €). Such a path has to 
consist of three parts: a path from vs to vi, the edge (v;,v;) itself and a 
path from vj to vg. The shortest length for the first part is dist;(v;) and the 
shortest length of the last part is diste(v;). Thus, an edge (v;,v,;) lies on a 
path between vs and vg of length at most (1 + €) dist(vs, va) if and only if 


distı (v;i) + ew((ui, v;)) + dista (vj) < (1 + €) dist(vs, va) . 


The result is a subset Esp C E of edges belonging to the short paths. 


4.2 Choices of edge weights 


Until now, we have considered graphs with edge weights ew((v;, v;)) = uł Pji 
that where introduced in Sect. 2.4. While this weighting is appropriate for 
graph partitioning algorithms which aim to minimize the internal cost of a 
partition, it is less useful for shortest path algorithms. 

Instead, we want to use an edge weight such that the length of a path 
((v1, V2), (V2, U3),---, (Vi, Vi+1)) from a vertex vı to a vertex vi+}ı reflects the 
product of the probabilities to choose the next edge along the path, i.e. 
I= Pj41,;- Thus, a high probability to go along this path should be re- 
flected by a short path and vice versa. 

We can do this by using shortest paths algorithms on the graph with edge 
weights 

1 
log(P;:) 


Then the length of a path ((v1, v2), (v2, v3), - - - , (Vi, Vi41)) is 


ew(vi, vj) : = — log(Pji) . 


1= X ew((vj,¥541)) = = loa] [ Pisis) 


j=1 


Note that the product of probabilities on the right hand side of this equation 
is the probability that the Markov chain described by the matrix P produces 
the considered sample path when started in v1. 
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Fig. 4.1. Schematic representation of the example. The rectangular domain of the 
pure diffusion with a reflecting boundary is discretized into 8 boxes. 


Motivational example 


As an example for a type of dynamics for which the edge weight introduced 
in the previous section seems inappropriate, we consider diffusion in a flat 
potential landscape (i. e. with V = 0, see below). We choose a rectangu- 
lar domain and apply reflecting boundary conditions. In Fig. 4.1 we give a 
schematic picture of the situation. Suppose, we start the process in box B. 
From the symmetry of the domain and the nature of diffusion, it is clear that 
the probability to end up in box A is the same as to reach the box C, namely 
0.5. But the particular decomposition of the domain, with the boxes 3, 4 and 
5 having only two thirds of the width of the boxes 1 and2, implies that the 
transition probabilities between boxes 1 and 2 on the one hand side and be- 
tween boxes 3 and 4, and 4 and 5 on the other hand are all equal. This means 
that the discrete path (B,3,4,5,C) is less probable than the path (B,1, 2, A), 
in contradiction to the continuous picture. 


Free Energy 


Another important quantity to characterize the transition behavior of a dy- 
namics in a complex system is the free energy barrier which the dynamics 
has to overcome on its way between two almost invariant sets. Suppose we 
consider the Smoluchowsky dynamics generated by the stochastic differential 
equation 

X(t) = -VV(X(t)) + V/26-1W(t) (4.1) 
where X(t) € R”, W is a standard Browian motion, V : R” — R is a potential 
and 8 is a parameter that is referred to as the inverse temperature. The 
probability to find the equilibrated system in a certain region, say C C R”, 
is given by 


uC) = Z- i exp(—BV (2) )ax (4.2) 


where Z is the normalization factor. The traditional way to define the free 
energy is by means of the marginal density with respect to a given reaction 
coordinate £: R” >œ R 


2(q) =Z f exp(—BV(@)6(€(2) - adr. (4.3) 
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Then the free energy is given by the logarithm of the partition sum Z(q): 
F(q) =—68 *nZ(q). (4.4) 
Discrete free energy 


Now consider a reversible Markov process on a finite state space S = 
{s1,..., Sn} and let 7 = (m,..., Tn) its unique stationary distribution. Anal- 
ogously to the continuous case, we define the free energy in terms of a prob- 
ability distribution 

F(i)=-Inm, >0, ie S. (4.5) 


New weight 


Given two disjoint sets A,B C S, we are interested in the state space path 
which crosses the lowest free energy barriers on its way from A to B. To this 
end, we introduce the new edge weights 


w(i, j) = |F; — Fil. (4.6) 
Let p = (i1,..., is) be a path such that 
Fi, < Fi 


jp È Ti Z Tipo J=1,...,8— 1 (4.7) 


then the length of the path is 
s—1 
Up) = X wii = Fis — Fh. (4.8) 
j=1 


This means that the weight of such a path is simply given by the free energy 
difference between the last and the first state of the path. Moreover, if we fix 
the states 7; and is, then all paths connecting these two states and satisfying 
(4.7), have the same length. Next consider a path p = (i1, ..., in) which can 





be decomposed into two parts pı = (i1,...,%s5) and p2 = (is,..-,%n) such that 
Bere. pa ee seed 
et aa meee ines (4.9) 
Fi; > Fijas J =se ne 
One immediately verifies that the length of such a path is given by 
ip) = 2h, — (Fi, + Fin) = 0. (4.10) 


Again, the length of the path depends only on free energy differences, namely 
the barriers F;, — Fi, and Fi, — F;,,. Consequently, if we fix the states 7; and 
in then the shortest path between 7, and i, w.r.t. to the weights (4.6) is the 
one which crosses the lowest free energy barriers. 
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Fig. 4.2. Schematic representation of the modified example. 


Interpretation 


The first observation is that the new weight allows to find barriers between 
invariant sets. Furthermore, it is more insensitive with respect to the under- 
lying discretization. To explain this issue, let us go back to example in 4.2. 
The probabilities to find the equilibrated dynamics in the boxes 1 and 2 are 
equal, and so are the probabilities of the boxes 3, 4 and 5. That means that 
there are no free energy barriers for the dynamics on its way to the box A 
or C, respectively, conditioned on starting in the box B and thus the paths 
are equal. But this exactly results from the new weight: The lengths of both 
paths are equal, 


1(B,1,2, A) = Fa — Fp = Fo — Fg =((B,3,4,5,C) = const. 


In the previous example the volumes of the boxes are equal. What happens 
if the volumes of the boxes differ? Suppose we decompose the box 4 into two 
boxes with equal volume. In Fig. 4.2 we give a schematic representation of 
the modified example. For this discretization both weights would tell that the 
path (B,1,2,A) is the preferred one since both the transition probabilities 
w.r.t. the box 4 and its stationary distribution decrease. But nevertheless, the 
new weight is more insensitive to the underlying discretization because the 
length of a path does not depend on the entire path but only on the barriers 
which the path overcomes. 


4.3 Modified update step in the Dijkstra algorithm 


The twofold contribution of a barrier can be seen as reflecting the reversibility 
of the process. If it is necessary to know the value of the sum of barriers only 
in one direction then this can be done by modifying the update step in the 
Dijkstra algorithm. Let v be the current node in the main loop of the Dijkstra 
algorithm and let k be a neighbor which has to be updated. Instead of using 
the weight w(v, k) we propose to use the weight w(v,k) = max{0, Fk — Fy} 
for updating the distance of the node k. Doing so, the bidirectional Dijkstra 
algorithm with the modified update-step computes the same paths as the 
unmodified one, but the length of a path only depends on the barrier in one 
direction. 
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Fig. 4.3. Left :Contour plot of the three-hole potential. Right: Equilibrium distri- 
bution at 8 = 1.67. 


4.4 Illustrative example: diffusion in a potential landscape 


In the following example we study the behavior of the bidirectional Dijkstra- 

Algorithm in the presents of two possible transition channels. We compare 

the families of transition paths resulting from the probability weight and the 
free energy weight. For this purpose we choose the three-well potential 

1 

V(a,y) = 3e7" ~ 3" 


Seat aes perl) ay 


L 3e- 0-5)? 


2 





which already has been investigated in [PS*03]. As one can see in the left 
picture of Fig. 4.3 the two deep minima at (—1,0) and (1,0) are connected 
by an upper and a lower channel. We choose the inverse temperature 8 = 
1.67 such that despite the dominance of the two deep minima there is still a 
little probability to find the dynamics in the shallow minimum around (0, 5). 
The dynamical bottlenecks in the upper channel are two saddle points with 
equal potential energy whereas the dynamics in the lower channel only has 
to overcome one saddle point with potential energy higher than that of the 
upper ones. 

The following experiments are based on a discrete realization of the dy- 
namics given in (4.1) for the inverse temperature 3 = 1.67. To be more precise, 
we use the Euler-Maruyama-scheme 


Uni = En — VV (an)T + V2BTIT m, n=0,...,N-1 (4.11) 


to discretize the SDE (4.1) in time, where x, € R?, 7 is the time step and 
mn denotes a realization of a gaussian random variable with mean zero and 
variance one. We choose the time step r = 107° and generate a trajectory of 
total length TN with N = 10°. 
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Fig. 4.4. Left: Stationary distribution of the transition matrix. Right: The free 
energy of the boxes. 


Equidistant discretization 


Next we decompose the rectangular domain into 30 x 30 equidistantly spaced 
boxes. In the left picture of Fig. 4.4 we depict the stationary distribution of the 
reversible transition matrix and in the right picture we plot the corresponding 
free energy. 

In Fig. 4.5 we illustrate the results of bidirectional Dijkstra for both 
weights, the weights which are based on the transition probability and the 
weights (4.6) incorporating the free energy. For all computations we use the 
same sets A and B which consists of boxes covering the two deep minima, 
respectively. In the two columns of Fig. 4.5 we draw the edges which belong 
to the family of shortest paths between the sets A and B. From top to bot- 
tom we increase the parameter e which results in increasing number of edges. 
The left column shows the edges of the most probable paths, whereas in the 
right column we draw the edges of paths which crosses the lowest free energy 
barriers. 

As one can see, both methods detect for small e the lower transition channel 
as the preferred one. But with increasing e the families of transition paths 
differ. The family of transition paths resulting from transition probability 
weight for e = 0.6 includes paths which overcome the big barrier in the middle 
of the potential. Since the probability that the dynamics leaves the basin of 
attraction scales exponentially with the barrier which has to be overcame, 
the paths over the big barrier make no sense. Although the dynamics could 
get trapped in the upper shallow minima, the two lower saddle points rather 
should allow the dynamics to make transition than to go over the big barrier 
in the middle. This behavior is reflected by the free energy weight as can be 
seen in the last picture of the right column. 
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Fig. 4.5. Left column: Family of most probable paths between the set A (left 
minimum) and the set B (right minimum). From the top to the bottom we choose 
e = 0.1, € = 0.3 and e€ = 0.6. Right column: Family of paths which crosses the lowest 
free energy barriers. From the top to the bottom we choose e = 0.01, € = 0.05 and 
e = 0.13. (See page 697 for a colored version of the figure.) 
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Summary. This article is concerned with stochastic differential equations with dis- 
parate temporal scales. We consider cases where the fast mode of the system rarely 
switches from one almost invariant set in its state space to another one such that 
the time scale of the switching is as slow as the slow modes of the system. In such 
cases descriptions for the effective dynamics cannot be derived by means of stan- 
dard averaging schemes. Instead a generalization of averaging, called conditional 
averaging, allows to describe the effective dynamics appropriately. The basic idea of 
conditional averaging is that the fast process can be decomposed into several ’almost 
irreducible’ sub-processes, each of which can be treated by standard averaging and 
corresponds to one metastable or almost invariant state. Rare transitions between 
these states are taken into account by means of an appropriate Markov jump process 
that describes the transitions between the states. The article gives a derivation of 
conditional averaging for a class of systems where the fast process is a diffusion in 
a double well potential. 


1 Introduction 


In complex system modeling, one often finds mathematical models that consist 
of differential equations with different temporal and spatial scales. As a con- 
sequence, mathematical techniques for the elimination of some of the smallest 
scales have achieved considerable attention in the last years; the derivation 
of reduced models by means of averaging techniques [FW84, AKN93, SV85, 
Kif02, Fre78, Kif01, Kif92, BLP78], homogenization techniques [BS97, Bor98, 
BS99], or stochastic modelling [MTV01, Mor65, Zwa73, MTV02] may serve 
as typical links to this discussion. 

This article is concerned with stochastic differential equations where the 
fast mode of the system rarely switches from one almost invariant set in its 
state space to another one such that the time scale of the switching is as 
slow as the slow modes of the system. The basic idea is that the fast process 
then can be decomposed into several ’almost irreducible’ subprocesses, each 
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of which corresponds to one metastable or almost invariant state. To quantify 
this principle, the rare transitions between these states are described by means 
of the expected exit times that can be used to parametrize a Markov chain 
model mimicking the transitions between the states. 


The Averaging Principle 
Let V:R™ x R” — R and consider the SDE 


if = —D,V(2*,y°) + oW (1.1) 
1 Sous 
jf = —- D V (xf, yf = 1.2 
y Ma (x y ) a Je 2 ( ) 
with € > 0 and W; (j = 1,2) standard Brownian motions. If we assume 


ao =<, the above SDE is well-known as the Smoluchowski equation. For e < 1, 
this system consists of a fast variable, y, and a slow one, x. Under suitable 
conditions on V (cf. [FW84]), averaging completely characterizes the limit x° 
of the slow dynamics 2° for e — 0 by an averaged SDE 


4? =— | D,V(2°,y) wro(y)dy + oWi, (1.3) 
R” 


where ug denotes the invariant density of the fast dynamics for fixed x: 


1 2 


poly) = aC ren Ze = eae ae (1.4) 
Ly ¢ n S 


which is assumed for each x to be the unique invariant density. 


Metastable Fast Dynamics & Exit Times 


Let us now assume that the fast dynamics exhibit metastable states , i.e., that 
the effective dynamics in the fast degrees of freedom (DOF) can be described 
by (rare) jumps between these sets, while in between the jumps the dynamics 
remains within one of these metastable subsets. Under this condition averaging 
may fail to reproduce the effective dynamics of the original system, mainly 
for the following reason: The averaging principle is based on the fact that 
the fast DOF completely explore the accessible state space before any change 
in the slow DOF happen; this can fail to hold if metastability is observed in 
the fast dynamics; in particular there is some subset of the accessible state 
space from which the fast motion will most probably exit only on some scale 
of order ord(1) or even larger. Let us make this rigorous by introducing the 
mean exit time for the process y$, from one of the metastable subsets, where 
y$ is governed by the SDE (1.2) for fixed x. If we assume the existence of two 
metastable sets R” = B, U BS with Bẹ N BS = 0, the mean exit time 7$(y) 
from B, is the expected value of the first exit time 7£(y) of the process y$ 
from B, started at y$ (t = 0) = y, which is defined by 
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TH(y) = inf f: ER: f Lpe(yS(s)) ds > 0, y$(0) = ub (1.5) 


Taly) = Tp, (y) 


where D° denotes the complement of the set D. 

Although we would expect that exit times depend on the starting point, 
i.e., y§(0) = y, it can be shown that there do exist subsets D, for which 
the exit time is basically independent for all states y € D. Especially for a 
metastable collection of sets D; of the Smoluchowski dynamics, in the limit 
of vanishing noise intensity we are able to assign a first exit time 7° to an 
entire subset D; rather than to single points y € D;, see [HMS02, SH02, 
SH00]. The question of the asymptotic behaviour of the mean exit time for 
vanishing noise term ¢ has been discussed in detail by, for example, FREIDLIN 
and WENTZELL in [FW84], from which the following result is taken (up to 
some slight modifications tailored to (1.1)&(1.2)) : 


Theorem 1.1 ( [FW84, Thm. 4.1 of Chap. 4], [SH00]). Let the potential 
V(a,-) be twice continuously differentiable, let ymin be one of its local minima, 
and B, a metastable subset with sufficiently smooth boundary OB, containing 
Ymin in its interior, but containing no other local minimum of V(a,-) within 
its interior. Without loss of generality we may assume that V(x, Ymin) = 0. 
Suppose that yo is the unique point on the boundary OB, with 


Vinee = V(x, yo) = min{V (x,y) PYE OB,}. 


The mean exit time 7$ for the process y$ with y$(0) € By then satisfies 
7E 
lim ç’ In a = IVF. 

As we are interested in the case where the averaging principle fails, let us have 
a closer look on the relation between the time scale of the fast motion and the 
exit times from metastable subsets in the fast DOF. The result of the above 
theorem tells us two things: First, rapid mixing of the fast DOF ( 7£ < 1) can 
be realized by fixing ¢ and the potential energy function; then we are always 
able to find an € small enough such that averaging yields a good approximation 
of the effective dynamcis. Second, if we decrease ¢ or increase the potential 
energy barrier, the smallness parameter € has to be chosen exponentially small 
such that the averaged system still is a satisfactory approximation. If we want 
to study the effect of metastabilities in the fast motion, it is natural to relate 
V,,,/<? to e so that the exit times from metastable sets vary on a timescale 
of order ord(1), that is, so that 


2 
T ~ C(aje exp(S Vbar) = ord(1), (1.6) 
where C(x) denotes the subexponential pre-factor that necessarily depends 


on x. Subsequently, the relation symbol ~ denotes asymptotic equality and 
ord is used to indicate comparison to the same order. 
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Conditional Averaging 


The scaling assumption (1.6) on ¢ represents a modeling step which will lead 
towards the derivation of the principle of conditional averaging that may yield 
an appropriate reduced model in cases where the ordinary averaging scheme 
fails: Since we observe rapid sampling of the invariant density jz. in each of the 
metastable subsets, we propose to average over each of these sets alone and to 
couple the resulting systems by a Markovian switching process which describes 
the flipping behaviour between the metastable sets. Then, in the case of (at 
most) two metastable subsets B® and BY) for fixed x, the conditionally 
averaged limit dynamics has the form 


a x? 5 
gz? = - f Davay) a Dy) dy + oW, (1.7) 
1 1 
Ony Ona 
uz’ (y) = — r kly) lpw ly) ey (y) = — z hely) po (yIA-8) 
Hic (BS”) ve B i 


with I(t,x) denoting the Markov chain model with state space S = {1,2}, 
where the rates of the jumps reproduce the transition rates of the original 
system. In [SW*03] explicit values for the generating rate matrix are obtained 
by using the most dominant eigenvalue \{(x) < 0 of the generator of the fast 
dynamics (1.2) together with the weights [ig (BO) of the metastable states on 
the fiber of the fast state space. 


Approach 


The authors of [SW*03] derived the limit dynamics (1.7) in terms of multiscale 
analysis of the Fokker-Planck equation, but there is no rigorous proof. The goal 
of this paper is to obtain a deeper insight into the nature of the conditionally 
averaged system. 

Subsequently we consider the SDE 


é = —D,V (£f, yf) + oW (1.9) 
1 oe 

J = —-D,V(a*,y°) + — W.: 1.10 

y € y (x y ) + Je 2) ( ) 


with € > 0 and W; (j = 1,2) standard Brownian motions. We assume the fast 


dynamics (1.10) to exhibit metastable states B® and BY?) so that the exit 
times from the metastable subsets happen on a time scale of order ord(1) or 
even larger. 

Under these assumptions, we may take advantage of the methodology em- 
ployed to extract the effective dynamics (1.7). This result (each metastable 
subset of the fast dynamics is connected to one averaged equation) motivates 
the idea to construct a system of fast-slow equations which allows for the 
incorporation of temporal fast scale effects in a natural way: the fast motion 
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within one metastable subset is approximated by an irreducible subprocess 
that corresponds to a stochastic differential equation. The result is quanti- 
fied by the parametrization of a Markov chain model I(t,x) that controls 
the switches from one (sub)process to the other according to the transition 
rates between the metastable subsets of the original dynamics. We thus ob- 
tain a stochastic process where the slow variable at each instance is coupled 
to one of two fast variables but where a stochastic switching process controls 
the switches from one fast variable to the other. Then, under appropriate as- 
sumptions on the potential V and for ¢ small, a good approximation of the 
original dynamics (1.9)&(1.10) may be given by 


if = -DV (£f, yf) + oWi (1.11) 


1 e e Ces 
JE = — wUz )) (gE) (yt — mE) (xE —_W. 1.12 
y č wW (x NG m (x )) ag WA 2; ( ) 


where w® (x) denotes curvature of V(a,-) in the potential minima of the 
metastable subsets BS? for i= 1,2, and m® (x) the respective minima. 

A reduced system in the slow variable solely is then obtained by applying 
the well-known averaging results from [apro 6, Kur73, FW84] to each of these 
stochastic differential equations. Denoting u9 UO the (unique) invariant den- 
sity of the process defined by (1.12) for fixed x and I(t, x) = i, the averaged 
system then has the form 


=- | pave x? y) u P )dy + o W, (1.13) 


where the pol denote the invariant densities of the Ornstein-Uhlenbeck 


(OU) processes (1.12) (for these we have explicit expressions). 

That is, we derive a description of the effective dynamics in two steps. In 
a first step we replace the fast dynamics in each of the metastable subsets by 
appropriate OU processes which are coupled to each other by a Markovian 
switching process that reproduces the transition times between BY and BY) 
of the original process. In a second step we simply use the invariant density 
of the OU processes in order to obtain the reduced system (1.13) by means of 
standard averaging. Recalling the conditionally averaged system 


=- | pav V (0°, y) uE (y) dy + oW, (1.14) 


with uD, i = 1,2 defined by (1.8), it is of considerable interest to compare the 
effective dynamics obtained by the two different approaches, namely (1.13) on 
the one hand and (1.14) on the other. Note that the jump processes I that 
corresponds to (1.13) will be derived in a different way than the jump process 
I of (1.14). However, we will see that I and J are comparable in a certain way. 
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2 System under Consideration 


Subsequently we study the SDE (1.1)&(1.2), where the following basic as- 
sumptions about the potential V = V (x,y) are made: 


Assumption 2.1 (i) V € C®(R™+}); 

(ii) V(a,-) is a double-well potential for alla € R™ with two local minima at 
y = m™®,m® and one local maximum at y = yo with mY < yo < m®; 
to point out the dependence on x we will also write m (a), i = 1,2. 

(iii) the position of the saddle point does not depend on x, without loss of 
generality we may assume yo(x) = 0 for every x; 

(iv) the extrema are non-degenerate uniformly in x, i.e., fori = 1,2 


DyyV (a, m©) = w(x) > G6 > 0, DyyV (£, yo) = —wo(x) < —õo < 0. 


Therefore, for fixed x, the particle spends a ’long time’ in one basin (=potential 
well), then quickly undergoes a transition into the other basin, in which it 
spends another ‘long time’, and so on. The condition yo(x) = 0 implies that 
for every x € R” the locations of the two basins do not depend on x such that 
the natural decomposition of the entire state space into metastable subsets is 
simply given by B® U B®), where? 


B® = {(@y eR ™ |y < 0}, B® = {(x,y) eR |y > 0}.(2.1) 


The double-well potentials may serve as toy models mimicking a larger sys- 
tem whose potential energy surface presents several basins corresponding to 
metastable states. 

As outlined in the introduction, we proceed in two steps to derive a reduced 
model for the effective slow variable dynamics. The key point for the first step 
is rooted in the design of V(x,-) which already suggests that an averaging 
procedure should incorporate metastabilities in the fast dynamics that are 
induced by the double-well structure: If the noise level in the fast equation is 
small, the diffusion sample paths of the fast process are located near the local 
minima of the potential wells, and transitions between the two potential wells 
can be considered as rare events. Then, the diffusion can be decomposed into 
two sub-processes 


E uy) = E), y H) gola (t), yO),  i=1,2, 


and a two-state Markov chain I(t, x) mimicking the transitions between B® 
and B?) which happen along the y dynamics and thus depend on the position 
of the slow one. Our approach is based on a quantification of the rates at 


3 In [SW*03], the metastable decomposition for fixed x is defined by the zero z 
of the second eigenfunction uı(x,-) of the fast dynamics generator. It is shown 
in [Wal05] that the zero z of u1(a,-) actually is approximated by the saddle point 
of the potential V(a,-). 
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which the fast process moves between the two subsets on the one hand and, 
on the other, on an appropriate replacement of the almost irreducible fast 
(sub)processes by appropriately chosen OU-processes evolving independently 
of each other. Thus, the most basic questions we have to address concern 
the fast process (1.2) for fixed slow variable x, which is done in Sect. 3. In so 
doing, we basically have to decompose the fast process into the intra-well small 
fluctuations of the diffusion around the potential minima and the inter-well 
dynamics of the diffusion. For both parts we then obtain by means of small 
noise asymptotics basic results that are then picked up in order to assemble 
in Sect. 4 the full dynamics approximation (1.11)&(1.12) including the slow 
variables motion. 

The second step of the approach relies on the small noise approximation 
and is based upon averaging results that can be found in a vast number of 
articles. A simple application of a theorem in [FW84] then provides us in 
Theorem 4.1 with the reduced dynamics (1.13). In the Appendix we show 
how the averaged dynamics can be derived by using multiscale asymptotics 
of the Fokker-Planck equation corresponding to the small noise approxima- 
tion (1.11)&(1.12). In Sect. 5 we compare the averaged dynamics (1.13) to 
the conditionally averaged system (1.7). 

Another important concern of the approach is the relationship between 
the noise level ç in the fast diffusion and the smallness parameter e: The fast 
diffusions inter-well and intra-well approximations are justified for vanishing 
noise ç, so that we suggest ç — 0 to zero as € — 0. Our considerations will 
result in a coupling rule for ç and e that incorporates the asymmetry of the 
double-well potential. In Lemma 3.5 the choice of the noise level ¢ is coupled 
to €e as well as the slow variable x so that the exit times from the metastable 
subsets of the fast dynamics vary on a time scale of ord(1) or larger resulting 
in (1.6). 

Biomolecules operate at ambient temperature and solvent condition, and 
most biomolecular processes can only be understood in a thermodynamical 
context. Therefore, most experiments on biomolecular systems are performed 
under the equilibrium conditions of constant temperature T, particle num- 
ber, and volume. Statistical mechanics tells us that statistical ensembles of 
molecular systems with internal energy V under these circumstances should 
be modelled by the equilibrium density exp(—GV). For the Smoluchowski 
system (1.1)&(1.2) this means to enforce ø = ç (for fixed €), such that ex- 
periments can be arranged with inverse temperature 3 = —2/<¢?. However, if 
the noise intensity ç depends on the slow variable x, it is hardly possible to 
interpret the model system in the context of equilibrium ensembles. Therefore 
our later Lemma 3.5 is not satisfactory as its application removes the system 
under consideration far away from the mathematical modeling of biological 
processes. Thus, the investigations have to be extended to situations where ¢ 
depends on € solely. This will be done in Appendix B. 
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3 Basic Results on Fast Process 


Let Assumption 2.1 be valid in all of the following. For small noise intensity 
S, the process yf corresponding to the Smoluchowski equation (1.2) for fixed 
x is almost decomposable into two subprocesses Yay Y(2)> each attracted to a 
minimum m (x), i = 1,2 of the function V(z,-). 

Thus, we consider the fast motion y$ (t) for fixed slow variables z € R”: 


= Wa, (3.1) 


and distinguish between the two different regions of attraction oS and OP 
where OS” is an open subset of BS” with m® (x) in its interior. The subsets 


ʻE 1 € 


BY and B®) are defined by the potential energy barrier: 
BY = {yeER:y < yol(x)}, and BY = {y: y > yo(a)}, (3.2) 


with yo(x) = 0 denoting the saddle point of the potential V(z,-). 

In the limit of small noise level ¢, Theorem 3.1 below will provide us for 
small ¢ with an approximation of the fast dynamics (3.1) restricted to a single 
metastable set by a simple Ornstein-Uhlenbeck (OU) process mimicking the 
rapid mixing in each of these subsets prior to exiting. There is no information 
in these stationary limits about the possible jumps from the branch y = 
m) (a) to the branch y = m®) (x), or conversely. To address the question of 
the overall behaviour of the stationary state, we will consider in Theorem 3.3 
and Corollary 3.4 below the new discrete-space process on {m' (a), m® (x)} 
assigning information about the inter-well dynamics. 


3.1 Approximation of Intra-well Dynamics 


For vanishing noise intensity ¢, in each of the subsets ow the fast diffusion will 
consist of small fluctuations around the potential minima m® (x) and m) (a), 
respectively. The drift term in (1.2) can now be expanded in a Taylor series 
with respect to y. Taylor-expansion of D,V(z,-) around m (x), i= 1,2 gives 


DyV(a,y) = DyyV (x, m® (a)) (y — m (x)) + O(ly—m (@)|?), (3-3) 
where we have used D} V (x, m® (x)) = 0. For y sufficiently close to m® (a), 
this provides us with an approximation of the SDE (3.1): 

„E 1 (i) € (i) Seer 
You) = =z% (x) You — m'(2)) + et (3.4) 


with w(x), i = 1,2 denoting the curvature of V (a, -) in m (a), see Assump- 
tion 2.1. The solution of the stochastic differential equation (3.4) is known as 
a process of Ornstein-Uhlenbeck type, or OU process for short. To distinguish 
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it from the decoupled’ processes y(,) € oO, i =1,2 that originate from (3.1) 
we denote it Youli) for i = 1,2. We omit the index for the fixed variable x. 
The quality of the approximation will depend on how close the original 
motion stays in the vicinity of the minima m® (x), i = 1,2. This can be made 
more precise by applying the small noise expansion method for stochastic 
differential equations. The basic assumption of asymptotically expanding the 
solution process Ya) for i = 1,2 into powers of the noise intensity ¢ leads to 
a reduction of the equation (3.1) into a sequence of time-dependent OU pro- 
cesses. Mostly the first order is quite adequate and amounts to a linearisation 
of the original equation about the deterministic solution. The reader may refer 
to [Gar85], where it is shown that the procedure yields a convergent power 
series of ¢. Tailored to the approach (3.4), the procedure yields a power series 


ys = Youw + s? RG), 


where the remainder R(¢,t) is the solution of an SDE and stochastically con- 
verges to r(0,t). That is, it exists a limiting SDE with solution R(0, t) such 
that for all T € Rt 


st-lim{ sup |R(c,t) — R(0,t)|} = 0, 
s>0 tejo,T] 


where st- liMmn—>oo En = € denotes limn—oo P{|En — €| > 6} = 0 for every 6 > 0 
and a sequence {&,,} of random variables. 


Theorem 3.1 ( [Gar85, Chapters 6.2, 4.3.7]). Let y$ be given by the 
SDE (3.1) where e€ and x are chosen arbitrary but fixed. Suppose that the 


process starts for some i = 1,2 in an open subset o® of BË containing 
m (x) in its interior, and let You) be the solution of (3.4). Then we have 
for all T € R+ 

lust) — you~(t)| = O?), 
where O is understood as being satisfied with respect to stochastic convergence 
uniformly in t € [0,T] (as s > 0). 


Remark 3.2. As the OU process (3.4) is ergodic, the stationary density io 
is simply given by the Gaussian with mean m (a) and variance ¢?/(2w (x)). 
Aiming at a comparison of pee and u, as defined by (1.8) and (1.4), it is 
shown in [Wal05] that 


lim(up ® —u®)=0 in L'(R). 
T 


Note that we do not get convergence in L°. 
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3.2 Asymptotics of Inter-well Dynamics 


To give a picture of the essential dynamics in the fast state space, we con- 
sider the statistics of the exit times from the metastable sets and approxi- 
mate the transition events of the diffusion by jump times of an associated 
continuous-time, finite state-space Markov chain (the double-well potential 
implies a two-state Markov chain). In principle, one can compute the exit 
times via direct numerical simulation. The approximated exit times can then 
be used to construct a transition rate matrix Q that generates stochastic 
matrices exp(tQ) for all times t > 0. However, the computational effort of 
estimating the expected exit times can be avoided by resorting to the rich 
literature on the derivation of asymptotic formula for the jump times that 
are strongly connected to the dominant spectrum of the corresponding gen- 
erator, see e.g. [Pav02, BGK02, BE*02, HKN04]. Whereas the first papers 
only gave the asymptotic behaviour of the logarithm of expected exit times 
(cf. Theorem 1.1), in [Pav02, BGK02, BE*02] one also finds estimates for the 
prefactor. 

For small noise intensity ¢, transitions between the potential wells occur 
at Kramers’ time that is given up to exponential order by exp((2/s?) AV), 
where AV is the potential barrier that the process must cross to reach the 
other potential well. The first exit time of the Markov process y$ (t) from D 
started at y$(0) = y as defined in (1.5) measures only exits that happen for 
some non-null time interval and depends on the realization of the Markov 
process. 

We are interested in the transition times between the metastable subsets 

BY and BË. If the noise intensity does not vanish, they are not identical 
to the exit times rT Bo? i = 1,2. Instead we have to modify the metastable 
subsets slightly ey that a (small) neighbourhood around the saddle point is 
included, i.e., we consider BS +6 = (—oo, 6) and Bz QO) s= (—ô, oo) instead 
with ô > 0 being a small parameter. Recall that o® € BË ) i = 1,2 are some 
regions of attraction (excluding a neighbourhood around the saddle point and 
including the potential Era that is, m® (x) € € O® fori=1 ,2). Then, the 
first exit times from By +ô and Be ) — ô are basically independent for all 
starting points y € oP and y € o? ; ) respectively. This enables us to assign 
the expected exit times from BP +ô and B® — ô to the entire subsets oP 
and oP rather than to single points. 

In the next theorem, we denote the expected transition times from Bo to 
BY? with i 4 j by T ;(£), i = 1,2. 


Theorem 3.3 ( [Pav02, BGK02]). The metastable inter-well transitions of 

the dynamics (3.1) are given by the following precise asymptotic estimates* 

as ç > 0: 

4 We emphasize again that in the following we will speak of (metastable) transition 
times between BY and B®) or metastable exit times instead of exit times from 
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Tél) = E eop los = T TTO exp (S ZV (e )), (3.5) 
Tal) = E,eo@ltse_4) ay exp (SVQ), (3.6) 
wW T) WoT 


where Vi) (x) = V (x, yo)-V (a, m® (a)) and VË (x) = V(x, yo0)-V (z, m®)(a)) 
denote the left and right potential barriers. 


In [Pav02] the result is obtained in terms of the largest eigenvalue 4 0 of 
the associated infinitesimal generator, which corresponds (apart from suitable 
weights) to the inverse of the expected diffusion exit times. The connection 
will be discussed in the next corollary. 

Our goal is to build a two-state Markov chain and view inter-well tran- 
sitions of the diffusion as simple jumps of this chain. Correspondence be- 
tween the diffusion and the chain will be established by exploiting that exit 
times are asymptotically almost exponential random variables which is shown 
in [SH02, HMS02]. Relying on this fact, we define the jump rates for the reduc- 
ing Markov-chain as the reciprocal of the expected exit times 7;°, ;(x), i 4 j, 
which provides us with the rate matrix Q$ being defined by 


a= (YE Vi), af) en 


The following corollary shows that the invariant density of the reducing 
Markov chain is asymptotically given by the weights over the potential wells. 


Corollary 3.4. Let us denote the (assumed positive and unique) invariant 


density of Q; by W(x) = (Yı (z), b2(a)), that is, 
v(x) Q, =0 with q(x) + Y2(z) = 1. 


Then we find that w(x) is given asymptotically as s —> 0 by (ux (BY), lx (BY), 
explicitly, 


pila) = we(BY), i=1,2. (3.8) 


The rate matrix QS, can be expressed in terms of the invariant density by 
introducing the second eigenvalue A$ (x) of the infinitesimal generator that 
corresponds to the diffusion (3.1). In so doing, we asymptotically obtain 


1 1 


Té olt) = ——————_ and T nle) ~» —————_, 
AS (x) [Mn (BP) A§ (2) | Me (BE?) 


and, conclusively, 


BY, i = 1,2, for the payreproue estimates are given for the mean values of the 
first exit times from BP +ô and BY — ô with ô > 0, where the precise choice of 
the parameter 6 is not important. 
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Siena SBE) pete) 


Proof. To establish (3.8), we simply have to verify 


po(Be’) _ Tle) 


[tg (BY) ~ Tale) ey) 


To this end, we may apply Laplace’s method of asymptotic evaluation of 
integrals depending on the parameter ¢. According to Laplace , we easily get 
the asymptotic estimates in the small noise limit 


(1) 
ey = (3.11) 
Ha x 

w(?) (a 

me exp ( z Ze, m™ (z)) — V (z, m®(2)))) (1+ O(s)), 


and, by using V(x, m® (x)) — V (z, m® (x)) = (V? (x) — VE (x)) together 
with (3.5)&(3.6), we end up with (3.10). The informations about the behaviour 
of A$(x) are again based on the results of PAVLYUKEVICH in [Pav02] who 
derived the asymptotic formula of \{(x) in the small noise limit by expanding 
Aj, into a power series. For asymmetric double-well potential this gives the 
accurate asymptotics for Àf (x) in terms of quantities concerning the shallow 


well of the potential: 


wh) (x)wo (ax 1 
Ao = EO xp (— SH) (1+. 0), 


where we assume without loss of generality 


VO (ae) = min{ VY (2), VE (2)}. 


r bar 


This result has been derived for asymmetric double-well potentials, such that 
the weight on the deep well is approximately 1, that is, ps (BY?) ~ 1. This 
obviously is fulfilled for small values of ç due to pz (BY) —> lass > 0. 
However, to include the case of symmetric double-well potentials (then we 
have uz (BY) = = pial W) = 0.5) we prefer to rewrite the asymptotics of A$ 
according to 

1 Yw) (x)wo(x) 


A$ (£)| us (B®) = = 


€ Qn exp (— Sve )) (1 + O(<))(3.12) 


which allows us due to (3.5) to express the transition rate 1 — 2 asymp- 
totically by [A$ |e (BO). Using the asymptotic estimates (3.11) and (3.12) 
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provides us with an alternative formulation for the asymptotics of Af (x) by 
using the curvature in the deep well (and the weight over the shallow well): 


2T X 


w2) (x)wolx 2 
AE) (BP) = YEO ap (- ENR) (14. O10). 


3.3 Freezing Metastable Transitions 


We complete the analysis of the fast process (3.1) with establishing a rela- 
tionship between the smallness parameter € and the noise level ¢ such that 
the scaling assumption (1.6) is fulfilled. According to Theorem 3.3 this can 
explicitly be realized only if exp(—(2/¢?) AV) scales like e. Here, AV denotes 
the barrier that has to be crossed, that is, AV = vo (x) or AV = V2) (x ). 
A natural way of realizing (1.6) was to rescale the potential energy barrier in 
an appropriate manner (see [Wal05}). 

However, due to the asymptotic investigations in Theorem 3.3 we leave the 
potential untouched and rescale the diffusion ¢ instead. An easy calculation 
leads to the following lemma. 


Lemma 3.5. To freeze the metastable transition times on a time scale t > 
ord(1) for every x as e — 0 it is convenient to set 


CeO ae 
ç = ç(e, £) = (e =a) , K>0 (313) 


Remark 3.6. In Lemma 3.5 we actually have to use the minimum of the two 
barriers uo. V2). Replacing min{ Viar(t )} by Vie) = = (1+ ôV ® for 6 > 


bar 
0 would lead to T$, = ord(e°). According to Corollary 3.4, the need for 
using the minimal barrier is equally expressed by demanding that the second 


eigenvalue à$ (x) asymptotically is part of the dominant spectrum. 


As outlined in Sect. 2, it is of considerable interest to study how to avoid 
coupling of the diffusion ¢ to x and still obtain large time conformational 
changes in the asymptotic limit € — 0. Based upon Lemma 3.5, the fol- 
lowing considerations will lead to meaningful conclusions (a short descrip- 
tion is given in Appendix B) that are strongly connected to results obtained 
by the approach via multiscale asymptotics with disparate transition scales 
(see [Wal05, Chapter 3]): Depending on the noise intensity o in the slow vari- 
able dynamics (1.1), the x trajectory will stay with overwhelming probability 
in a bounded domain D(c) of its state space”; if we choose V3" according 
to the rule 


5 Note that o is not related to s, and we do not demand for small values of ø. 
However, it should be clear that a choice of VS"! could depend on the actual 
size of a. This becomes more clear by considering Figures 6.5 and B.1. Therefore, 


we write D = D(c) for the bounded region. 
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small __ $ (i) 
Via = min{ Vga 


(x) |x € D(o), t= 1,2}, 


and set 


/2 
2V emal 1 
= | —2ar_ K 14 
do = (AB) o Kn (3.14) 


we expect the metastable transitions to happen on a time scale t > ord(1). If 
the potential energy barriers outside the domain D(c) are smaller than Vm", 
the particle will for very small e instantly jump over the barrier once it has 
reached the complement of D(a). Then, the time of the metastable transitions 
will be somehow connected to the expected exit time of the x dynamics from 
D(c). The above idea is justified by rigorously examining the asymptotics of 
the metastable transition times considered in the entire state space. Tailored 
to exemplary situations we outline the procedure in Appendix B. 


4 Derivation of Reduced Dynamics 


We return to the dynamics (1.1)&(1.2) and use the results of the preceding 
section for the design of a small noise approximation of the original process. 
The approximated system is then used in Theorem 4.1 as the basic system for 
the application of standard averaging theorems resulting in the reduced slow 
variable dynamics. 


Small Noise Approximation 
In all of the following let (xf, y€) be the solution of the SDE (1.1)&(1.2). 
Exact jump process. 


As a first step let us introduce the process Î that describes the jumps between 
the metastable sets in y-direction as given by the original dynamics: 


f(t) =141,@ (y‘(). (4.1) 


z(t) 


With this defined, let us denote by (ĉu, Joy) the random process determined 
by 


t = —D,V (x,y) + oW (4.2) 
1 | E Ç ou: 

pe a E) Aa) 23 4 

j= -ECO a) (y mF") + tr, (4.3) 


Now, suppose that the initial points are chosen such that (x*(0), y€(0)) = 
(Zou (0), Hou (0)) = (z, y). 
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According to Theorem 3.1 we obtain for any x,y, T > 0 and e€ > 0 that the 
process (x*(t), y°(t)), t € [0,7] of the original dynamics and the random 





process (%5y(t), Joy (t)) get arbitrarily close to each other for ç — 0. 


However, we will not concentrate on the rigorous mathematical justifica- 
tion of this result, mainly for reasons given in the next paragraph. 


Approximate jump process. 


This result may be very nice. However, it has the crucial disadvantage that 
we will never have the process Î without knowing the actual solution of the 
original process. Therefore, we replace the jump process I by its approximate 
version I as constructed above. Obviously, this will prevent us from being 
able to construct any kind of pathwise convergence. However, it will finally 
allow to construct an approximate dynamics that is explicit in the sense that 
it does not depend on any knowledge about the original process. To this end, 
we denote by (oy, yuy) the random process determined by 


è= —D,V(a,y) + oW (4.4) 
1 ç 

C) — mit) < i. 4.5 

y é w (x) (y m (x)) + Ve 2) ( ) 


with I(t, x) € S = {1,2} denoting the z-dependent Markov chain model whose 
transition rate matrix Q$ = (qi;)i,j is given by its entries 


qi1(2) = —qı2(x), q22(£) = —q21 (£), 
wD (x)wo (a 1 

mela) = Ol) oy Fn), (4.6) 
w) (aw (a 2 

gala) = A ED oy F(a), 


Again, suppose that the initial points are chosen such that (2*(0), y€(0)) = 
(x y(0), ¥u(0)) = (a,y) and I(t = 0,2) = i for (x,y) € BO. 

According to Theorems 3.1 and 3.3 and Corollary 3.4 we can expect for 
any x,y, T > 0 and e > 0 to obtain a good approximation of (x*(t), y€ (t)), t € 
[0, T] by the random process (xOy(t), Yoy(t)) whenever the noise level ç in 
the fast equation is small enough. 

We will call the dynamics (4.4)&(4.5) in the following small noise approxima- 
tion or OU approximated dynamics. 


Averaging 


In Theorem 4.1 we finally arrive at the reduced slow variable system by apply- 
ing standard averaging theorems to the small noise approximation (4.4)&(4.5) 
where the transition rates of the jump process I(t, x) are given by (4.6). 
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Theorem 4.1 ( [FW84, Chapter 7]). Let (coy, yOy) be given by (4.4) (4.5) 
and denote by x°(t) the solution of the differential equation 


= J DV (a, y) LYTE (y) dy + o Wi, (4.7) 


where were is the (unique) invariant density of the process defined by (3.4) 
for fixed x. Then for any T >0 and ç >0 we have 


st-lim{ sup_[eou(t) — #°(t)|} = 0. 
e >" tE[0,T 


Subsequently, we refer to the slow variable dynamics (4.7) as the OU averaged 
dynamics. In Appendix A we use multiscale asymptotics of the corresponding 
Fokker-Planck equation to derive the OU averaged dynamics from the small 
noise approximation. 


5 Comparison to Conditional Averaging 


To complete the discussion and re-establish reference to the conditionally aver- 
aged system (1.7) we finally examine its closeness to the OU averaged dynam- 
ics (4.7). In so doing, we basically compare the behaviour in the asymptotic 
limit ¢ — 0 of 


1. the drift term in (4.7) and (1.7) for fixed F(t, x) = I(t,x) = i; 
2. the corresponding transition chains J and J that control the switches 
between i = 1 andi = 2. 


First, let us consider the transition chain I(t, £) of the conditionally aver- 
aged system (1.7) as given in [SW*03]. There, the transition rates of the jump 
process I are defined by the rate matrix 


x en) | pale) aB) 
0. = nol, bz (BY) = ftg (BS?) ? (5.1) 


where A$ (x) is the second eigenvalue of the infinitesimal generator of the dif- 
fusion (1.2) that is assumed to be of order ord(1). We compare the entries 
in (5.1) to the transition rates qij of the jump process I(t, x) corresponding to 
the OU averaged dynamics that are defined in (4.6). Exploiting the asymptotic 
results of Corollary 3.4 and under a certain additional assumption, the tran- 
sitions rates qj; are asymptotically equal to the rates of Qz. The additional 
assumption that has to be fulfilled concerns the metastable decomposition as 
derived by applying conditional averaging: In [SW*03], the limit dynamics are 
derived by projecting the ensemble dynamics of the original system onto the 
subspace spanned by the dominant spectrum of the infinitesimal generator Ly 
of (1.2). Then, the metastable decomposition BY and BY) will be defined in 
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Fig. 5.1. Asymmetric double-well potential. Any decomposition defined by a point 
zx that is situated between the dashed lines leads to asymptotically wrong results if 
the decomposition is used for the conditionally averaged dynamics. 


terms of the second eigenfunction u1(x,:) of Ls and not by the saddle point 
of the potential V(z,-) as is done in (3.2). Thus, for the definition of Q we 
have to use 
BY = {y: u(a,y) <0} and B®) = {y: us(z,y) > 0}, 

where we can assume that BS is the left subset. It should be clear that the 
zero z of u1 (x, -) must be somewhere between the two potential minima m“) 
and m(?), and in fact, it is only a small step from using results in [Pav02 
to show that z asymptotically (as ç — 0) approaches the saddle point yo(z), 
cf. [Wal05]. The attentive reader may convince himself that it the result is 
of crucial importance, as other choices of the zero z between the potential 
minima may lead to fatal approximation errors (not only for the transition 
rates but also for the drift term), compare illustration in Fig. 5.1. 
_ Having obtained the asymptotic equality as ¢ — 0 of the jump rates of 
I and I, we still have to compare the drift terms in (4.7) and (1.7) for fixed 
I(t, x) = I (t,x) = i. The terms vary for fixed x in the probability density that 
is used to obtain the averaged force on the slow variable x. We apply standard 
Laplacian asymptotics in the limit of vanishing noise ¢ — 0, which provides 
us for i = 1,2 with the precise estimates 





J Pa¥ eu) no W)dy = DaV(e,m(0)) (1409), (6.2) 





J DV eu) OW) dy = DeV(e,m(@)) (1+ 09), 


where the derivative D,V(x,m™ («)) is taken wrt. the first component solely. 

Conclusively, let us suppose that ¢ = ¢(¢) is coupled to e€ by using (3.14). 
Replacing ¢ by ¢(e) in the fast equation (1.2) of the original process will lead 
to a time scale separation of the fast dynamics in y and the metastable transi- 
tions between the potential wells. Then, application of the ordinary averaging 
procedure will destroy the information about slow mixing between the two 
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branches and the result becomes inappropriate to render the effective dynam- 
ics. By contrast, application of Theorem 4.1 does not require to fix ¢: Even 
if ç = ¢(e) due to (3.14), the reduced model (4.7) will represent the effective 
dynamics of (4.4)&(4.5). We get this result because the averaging procedure 
does not affect the Markov chain I(t, x) that stores the distributional infor- 
mation of the metastable transitions. Therefore, by examining the averaged 
system (4.7) as ç — 0, we will obtain the differential equation 


& = D, V (x, m€) (x)) + oW, 


that is considered as the final limit SDE of the original process (1.1)&(1.2) 
with ç = ç(e) given by (3.14) as € > 0. 


6 Numerical Experiments 


In this section we illustrate the results from the preceding section by numerical 
experiments with an appropriate test example. 

We consider the Smoluchowski equation (1.1)&(1.2) where the potential 
for the numerical analysis is given by: 


V(a,y) = 2.5 (y? — 1)? — 0.82y? + 0.00527 + 1.6, (6.1) 
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Fig. 6.2. Left: Potentials V(z,-) in y for different values of x. Right: potential 
barriers Vo (x) (full line) and V2 (x) (dashed line). 
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Fig. 6.3. Typical realization of the original dynamics for ø = 1.0, ç = 0.75 and 
e = 0.0064. Left: trajectories x coordinate; right: trajectories y coordinate. 


which clearly satisfies Assumption 2.1. The potential energy surface is shown 
in Fig. 6.1. At the left hand side of Fig. 6.2 we illustrate the double-well 
potentials V (x, -) for different values of x. The saddle point always is yo(x) = 0 
and takes the value V(a,0) = 4.1 + 0.005 zt, the potential minima are 


m (x) = 0.122 + (-1)'V0.0576 2? +4, i=1,2. 


The right side of Fig. 6.2 shows the potential barriers vio (x) (the left barrier) 
and vye (x) (the right barrier) as functions of x. 

In Fig. 6.3 we show a typical realization of the dynamics (1.1)&(1.2) with 
g = 1.0,¢ = 0.75 and e = 0.0064. For the generation of the trajectories we 
use the Euler-Maryuana scheme with internal time step dt = ¢/100. We 
clearly observe that jumps between the metastable decomposition BC) = 
{(x,y) |y < 0} and B® = {(x,y) |y > 0} induce metastable transitions 
in the x dynamics between x < 0 and x > 0. Comparison with the averaged 
trajectory in Fig. 6.4 reveals inappropriateness of the standard averaging pro- 
cedure (1.3). In Fig. 6.4 right we illustrate the averaged potential V (known as 
Fixman potential or conformational free energy landscape)that is associated 
with the realization at the left: 
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Fig. 6.4. Left: Typical realization of the simply averaged dynamics (1.3) for o = 
1.0, ¢ = 0.75. Right: Fixman potential that corresponds to the trajectory at the left. 
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2 
= S 2 

V(s, £) = 55 in f exp (- aV (a,y)) dy. 

Using standard Laplace asymptotics provides us with the potential in the limit 
¢ — 0 of vanishing fast diffusion 


V(x) = min {V (z, m®(x)) ,V (z, m) (x)) k, 


In Fig. 6.4 we additionally plotted V(x), which graphically is completely iden- 
tical to V (ç = 0.75, £). 

Fig. 6.3&6.4 explicitly visualize the simply averaged dynamics to be inap- 
propriate to render the effective dynamical behaviour of x(t) as €e > 0. For 
small e diffusion in y is very fast compared to diffusion in x. However, the im- 
portant (and only) barriers of the potential are barriers in y direction. Thus, 
for fixed €, decreasing the noise intensity ¢ in the fast equation increases the 
metastability in y. Consequently, by choosing different ç one can analyze the 
effect of increasing metastability on averaging. To this end, it is convenient 
to use the x averaged values of the expected transition rates 1/7;,;(x). As 
detailed in Appendix B this provides us in the asymptotic limit ç — 0 with 
the expected transition times or between the metastable decomposition 
BO U B®) in the (x,y) state space. 

We generated N = 2000 realizations of the original dynamics for € = 
0.0064, o = 1.0 and ¢ = 0.75, 0.7, 0.65, 0.60, and waited for the first exit times 
from B®. The top row in Fig. 6.5 illustrates the location of the trajectories 
z-coordinate right before the transitions occured; the pictures at the bottom 
display the function under the integral in (B.4) (normalized to 1) and nicely 
illustrate that the major contribution to the integral in (B.4) will move right- 


wards as ç — 0, for vio (x) — 0 as x > oo. Comparison of the upper and 


iy ant tes orig arara m E ae eget es hs z up at Pee ae ee 














Fig. 6.5. Top: Transition location (from B® to B®) of the trajectories x- 
coordinate computed by means of N = 2000 realizations of the conditionally av- 
eraged dynamics for ø = 1.0, e = 0.0064 fixed and ¢ = 0.75, 0.7, 0.65, 0.6. Bottom: 
Function under the integral in (B.4) normalized to 1 by using the same parameters 
as above. 
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Table 6.1. Expectation values of transition times from B® to B®) corresponding 
to Fig. 6.5. The values for T,_,2 are obtained by using (B.2) or (B.4). 


= 0H 
mean value from 2000 real. 1240 











the lower pictures reveals almost coincidence between the contribution to the 
integral in (B.4) and the actual location in the x space of the jumps from B® 
to B®). Finally, we compare in Table 6.1 the averaged values of the tran- 
sition times to the numerically obtained values by means of the N = 2000 
realizations. We observe that ¢ has to be chosen small to get closeness. 


Discretization 


The pathwise simulation of the dynamics consisting of the two state Markov 
jump process I(t, x) is developed by using a specific stochastic particle method 
( [HSS01]). To this end, recall the infinitesimal generator Q% = (qj;())i,j that 
allows to calculate the hopping probabilities between the states S = {1,2}. 
The transition matrix P£(x) = (pj,;(T,x)) at time 7 is then obtained by 


P;(a) = exp(rQ;). 


A straightforward calculation reveals 





3 qi2(x) = (afo(«) +484 (2)) 

Pilm, x) = —————__ (1 -e M2 921\7))) (6.2) 
5A qialz) + 451 (2) 
P q5ı (2) -r (afz (2)+451 (2)) 

palra) = EE (1 — rafal) tee), (6.3) 
a G2(x) + 451 (2) 





The entries of Q$, are given in (4.6) by the inverse of the precise estimates of 
the expected transition times over the potential energy barrier in y direction. 

The stochastic particle method requires two steps. We shortly demonstrate 
it for the OU averaged dynamics (4.7). 


Step 1: Transport. The first step consists of determining an updated position 
x(t + dt) by solving 


Ce - [ DV) OYO (y)dy + oW, 


over [0, dt] with initial point x(t). 

Step 2: Exchange. The second step models the exchange between the states 
I(t,z) = 1 and I(t,x) = 2. Thus, if i = 1, we set i = 2 with hopping 
probability pi.2 = p{.(dt, x(t + dt)) and remain at i = 1 with probability 
1 — p42. Vice versa, if i = 2, we set i = 1 with hopping probability pg, = 
p§, (dt, c(t + dt)) and remain at i = 2 with probability 1 — po.;. Return to 
step 1 by setting x(t) = x(t + dt). 


668 J. Walter, Ch. Schtitte 
Parameter Choice 


Subsequently, we choose the noise intensity in the slow equation o = 1 and the 
smallness parameter is € = 0.0064. Trajectories are illustrated with ç = 0.75, 
whereas for comparison of exit times we use different values of ¢. 

Recalling coupling ¢ to € according to (3.14), some words seem to be neces- 
sary concerning the comparison of the full dynamics to the OU approximated 
ones: Without loss of generality we can choose ¢ arbitrary without considering 
the coupling, for the experiments are performed for a fixed value of e. There- 
fore, for fixed € = e* and fixed ç = ¢* we can always find a constant K = K* 
(or a barrier Vsm2!! = ysmall*) such that ¢(e*) = ç* under (3.14). Even if we 
take (3.13) as the basis of our computation, we can desist from the coupling 
rule, for the constant K then can be chosen dependent of x, such that we 
still arrive at ¢(e*,2) = <*. Actually, the postulation of relating ç to € only 
serves as a formal justification of the OU approximation. For the numerical 
implementation only the size of ¢ by its own is of importance, not its relation 
to €. 

The motivation to choose ø = 1.0 and not ø = ç can be infered from 
Fig. 6.5. In case of smaller values of ø, say o = 0.75, the x-coordinate of 
the trajectory will hardly reach the region where the jumps mostly happen. 
Then we had to choose ¢ larger, which on its part would result in a worse 
approximation of the intra-well fast dynamics. We will come back to this 
problem below. 


6.1 Comparison Between Original Dynamics and Small Noise 
Approximation 


Here, we carry out numerical studies in order to compare the Smoluchowski 
dynamics (1.1)&(1.2) with those governed by system (4.4)&(4.5) with fast 
OU processes and transition chain I(t, x) that controls the switches between 
the two OU processes. 

Typical realizations of both the original dynamics and the OU approxi- 
mated ones are shown in Fig. 6.6. The trajectories have been generated using 
the Euler-Maruyama scheme with time step dt = €/100 for both systems. Ap- 
parently, the transition rates between B®) and B®) coincide to some extend 
and the oscillating motion (around the potential minima in y) inbetween the 
transitions seems to be well approximated by using OU processes in the fast 
equation. We clearly observe that jumps induce metastable transitions in the 
x dynamics between x < 0 and x > 0. However, for the trajectories being in 
B) we observe that the x-coordinate of the original dynamics spreads con- 
siderably further rightwards than the x trajectory of the OU approximated 
system (and for the trajectories in B®) the original dynamics «-coordinate 
spreads further leftwards). 

The above observation suggests that the original dynamics have noticeable 
smaller transition times between B® and B®), as the original dynamics more 
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Fig. 6.6. Typical realization of the original dynamics (top) and the approximated 
dynamics with fast OU processes (bottom). At the left we see the x, at the right 
the y coordinate. The realizations have been computed for the same realization of 
the white noise (in the slow and in the fast equation). 


often reaches a domain where the potential barriers (in y direction) are small. 
This is confirmed by Table 6.2, where we computed the expected transition 
times from BM to B®) by means of N = 2000 realizations for two different 
values of ç and ø = 1.0, e = 0.0064 fixed. We come back to this problem in 
the next section where we include the averaged dynamics into the numerical 
examinations. Actually, it will turn out that ¢ has to be chosen very small to 
get perfect coincidence of both the original and the OU approximated system. 


6.2 Results Including Averaged Dynamics 


We now demonstrate pre-eminence of the OU averaged dynamics (4.7). To 
complete the representation we include the conditionally averaged dynam- 
ics (1.7). 

In Fig. 6.7 we compare realizations of the averaged to the full dynamics’ x- 
coordinate. Every trajectory has been computed with the same realization of 
white noise Wi, Wa, such that the internal time step has been set to dt = €/100 


Table 6.2. Exit times from the set B“ for the original dynamics and the OU 
approximated system. 


dynamical model 





OU approximated dynamics 
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Fig. 6.7. Realizations of the original dynamics x coordinate (top, left), the x co- 
ordinate of the (full) OU approximated dynamics (top, right), the OU averaged 
dynamics (bottom, right), and the conditionally averaged system (bottom, left). 


even for the averaged dynamics. The Markov jump process I(t, x) is realized 
by using the same realization of random numbers for every concerned system. 
Concerning the systems with OU processes (full and OU averaged), we observe 
pathwise convergence of the x trajectories, whereas comparison of the orig- 
inal dynamics with the conditionally averaged system reveals distributional 
coincidence. 

In order to present numbers instead of pictures we want to compute the 
expectation values of the metastable transition times from x < 0 to x > 
0 for different values of ç. It is natural to expect that this is realized by 
computing the first exit times from the set S + ô with S = {x € R |z < 0}, 
where ô > 0 has to be large enough to guarantee that the process effectively 
reaches some (small) region of attraction in the complement of S. But Fig. 6.5 
nicely shows that the x-coordinate can spread far into the positive region even 
when it is restricted to the metastable set S. Thus, we suggest to define the 
stopping time as the first exit from B® instead, respectively the first jump 
from I(t, x) = 1 to I(t,x) = 2. At least for ç < 7.5 (compare Fig. 6.6) this is 
equivalent to the metastable transitions from x < 0 to x > 0. From N = 2000 
realizations for € = 0.0064 and o = 1.0 we get a very good agreement between 
the OU approximated dynamics and the OU averaged dynamics, and a good 
agreement between the original and the conditionally averaged dynamics. 

However, there still remains the problem of difference between the OU av- 
eraged and the conditionally averaged dynamics. To overcome the problem, we 
illustrate in Fig. 6.8 the potentials that correspond to the respective trajecto- 
ries for ç = 0.75, 0.60, 0.3. For I(t, x) = i € {1,2} and ç fixed, the conditionally 
(4) 

( 


averaged potential VC, x) and the OU averaged potential ia ¢,x) are 
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Table 6.3. Comparison of exit times from the metastable set S = {x € R |x < 0}. 
For ¢ = 0.65, it was not possible to compute the exit times of the full dynamics’ 
motion within a reasonable period of time. 


dynamical model 
original dynamies | 113 [ 210 [| — | 
conditinally averaged dynamics 1240 








OU approximated dynamics | 136 | 265 | -- | 
OU averaged dynamics 1537 





defined implicitly by 


= OU (i) 


DVO ka) = J Dvema 


DV? (a) = f Davle HW) Ay, 
and we easily show that 


(i) s? 2 
VA = -Sm f exp (— =V(a,y)) dy. 
2 BO S 


Exploiting the estimation method of Laplace we obtain asymptotical identity 
of both potentials: 

lim V” (s,2) = lim VP (s,2) = V(a,m (a). 

[Sas se 


Fig. 6.8 reveals VO(c,2) ~ Vai (s,x) for ¢ < 0.3, whereas they differ 


visibly for ¢ > 0.60 mainly in that region where the jumps from 7 = 1 toi = 2 
mostly happen. This perfectly explains the significant difference concerning 
the transition times in Tables 6.2&6.3. 


A Derivation of Reduced System by Multiscale Analysis 


Here, we show how the averaged system (4.7) can be derived from the sys- 
tem (4.4)&(4.5) with fast OU processes by using multiscale asymptotics wrt. 























averaged potentials for ç=0.75 
averaged potentials for ç=0.60 








b 
D 
xo 


Fig. 6.8. Comparison of conditinally averaged (full line) to OU averaged potentials 
(dashed line). From left to right: ¢ = 0.75, 0.60, 0.3. 
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the smallness parameter e. As the method is applied to ensemble instead of 
single dynamics we have to set up before the necessary requirements concern- 
ing the evolution of probability densities. 

Let us extend the fast-slow system with two OU processes to a finite 
number of OU processes. Thus, we consider the process (toy, You) E R” xR 
that is determined by the following SDE: 


z(t) = —D,V(a,y) + oW, (A.1) 
Ht) = -EoD a) (y= meda) + “Bra, (A) 


where I(t, x) is a right-continuous Markov chain on a probability space taking 
values in a finite state space S = {1,2,...,N} and w(x) takes values in 
R*+ for all i € S. The noise intensity of the fast diffusion may depend on 
x, but is assumed to be strictly positive, that is s(x) > c > 0. To simplify 
notation we perform the asymptotic procedure without a possible dependence 
of ç on the Markov chain I(t,2); a generalization in this direction had no 
effect on the computation. The generator Qy = (qij(£))nxn of the switching 
chain I(t, x) depends on the slow variable x and contains the transition rates 
qij = Gij(L) > 0 from i to j if i #7 while 


g(t) = -$ > ais(2). (A.3) 
tj 
For fixed « € R™ andi € S the diffusion dynamics (A.2) is known as OU 





process and consequently ergodic. The (unique) stationary density gor is 
given by 
; 1 w) (x) (y — mO (x))” 
OU fy) = — (- (i) | AA 
p(y) = Toy ap (-w@ HT), ay 


which is a Gaussian with mean m(x) and variance ¢(x)?/(2w™(x)), and 
thus independent of e. 

The evolution of probability densities p€ € L!(R™+! x S) under the dy- 
namics given by (A.1)&(A.2) is described by the forward Fokker-Planck equa- 
tion. Here, we are working in unweighted function spaces, that is, the density 
p° gives the physical probability to find the system in state (x,y) at time t. 
For later use it may be helpful to slightly change notation for the densities 
p“: The agreement Pia (t, x,y) := p(t, x,y, i) enables us to represent pê as an 
N-dimensional vector according to p€ = (Pi): +) (wy) with pip € IRE), 
Now, the Fokker-Planck equation is regarded on some suitable subspace of 
[1(R™*! x S), and reads 


Op’ = A p$, A = Az + Ay + 27, (A.5) 
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AY o 0 0 aP o o o 

0 A? o o 0 AP o o 
Ar a , Ay = : 

0 05 0 0o 05. 0 

0o o0 oA 0 o0 0 A 


where A” and A‘ are given for f € L1(R™t+) by 


LEY Ay fley) + Dy(w(0)(y—m(a)) f(2.9)) 


2 
A® f(x, y) = Act + Dy (D:V (£, y) f(z, y)). 





A® f(x,y) = 


Note that we actually have to use QT in (A.5), for the rate matrix Q is basi- 
cally considered to be part of the backward Chapman-Kolmogorov equation, 
that is, it describes the evolution of the expectations of functions of the state 
of the system. Consequently, the probability to be in state (x, y) is given by 


(p'(t,2,y),1)s = X vty (t, x,y), 


iES 


(-,-)g denoting the Euclidean inner product in RY. 

Our aim is to average with respect to the fast variable y and obtain an 
averaged equation for the slow variable x alone. To this end, we will use 
multiscale analysis. 


Projection Operator 


We would like to derive an equation for the distribution function in z: 


fe (t,x, y), 1 sdy = > fp Puy(t, £, y) dy, 


iES 


which would be valid in the limit where e becomes very small. To this end, we 
introduce the vector p(t, £) = (Bi), ies PONS) with densities Bj, € D(R™) 
defined by 


Piata) = f vigta y)dy. 


It is expected that an approximate solution of the full dynamics would be 
obtained by multiplying each p(, (t, x) by the stationary distribution ie UO of 
the SDE (A.2) for fixed I(t, x) = i. We formalize this by defining a projection 
operator I = diag(II,..., HO) acting on functions f = (fi,..., fw)? 
LI(R™+! x S) by 


(IH f)(£,y) = diag(uel, ..., p20) JE z,y)d 
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It is obvious that I projects any function into the subspace of all functions 
which can be written in the form 


F= (frea fN), Filey) = Fila) O° (y), (A.6) 


where f; is an arbitrary function of L1(R™), thus f = (f1,...,fy)" 
L!(R™ x S). In the following we study the case where the initial condition 
p*(t = 0,2, y) can be expressed by 


p(t =0,2,y) = (Hp (t = 0))(2, 9). 
However, functions f of the form (A.6) are all solutions of 
Asf = 0, 


that is, the space into which IJ projects is the kernel or nullspace of Az 
expressed by A, JT = 0. Due to the properties of AO considered as an operator 
acting on functions g in y, that is g = g(y) € L1(R), we furthermore have: 


TA, = 0 = Agi. (A.7) 


This is easily seen by introducing the formal adjoint TO of AS), i.e., a dif- 
ferential operator such that for all u € L1(R), v € L® (or u,v € L?(R)) we 
have 


(AP u, vjr: = (u, Tv) p2, (u,v) 12 = [ wy) u(y) dy. 


If we consider IJ“ — for fixed x — as an operator acting on functions in y, we 
can rewrite it by l : 
Tu = (u, 1)z2 y9Y®. 


Together with the well-known fact that T1 =0 (see, e.g., [SHDO1, Hui01}) 
we finally get the desired result (A.7). 


Multiscale Analysis 


We now make the following ansatz for the solution of the Fokker-Planck equa- 
tion with the initial conditions described above: 
p= p Hep Hep +t... 
This ansatz is inserted into the Fokker-Planck equation (A.5) and then, by 
comparison of coefficients of different powers of € we get: 
el: Asp =0 (A.8) 
e: Asp! + (Ay + Q7)p° = &p° (A.9) 
el: Arp + (Ay +Q7)p' = pt (A.10) 
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Step 1: (A.8) immediately yields that p? € N(Az), i.e., 


Ip? = p®, equivalently (A.11) 
Peay) = diag (Wee, «pe PO D(t, £), 


for a function p? € L1(R™ x S) depending only on z. 


Step 2: Let JI act on (A.9) and use (A.7). This time we get: 


IT(Ay + Q7) Ip = OTT p°. (A.12) 
By using (A.11) simple calculations reveal for p° = (Pl) ---, Ply)”: 
Op = (A+ Q7) P, (A.13) 
with ; 
Ao 
=F (2) 
= 0 A 
A= i ; 
0 0 » 0 
G: Wk Oa,” 


i o? i 
A? = Fa, + Da f Dave Ow: ), 


A acting on L'(R™ x S). Thus p° is determined by a Fokker-Planck equation, 
and its solution gives us pf up to error O(e). The associated SDE is given by 


t= - f Daves) gT (y)dy + oW, (A.14) 


with solution process x°(t) where I(t, £x) € S controls the switches between 
the different OU processes due to the rate matrix Q = Q,. The SDE (A.14) 
describes the limit dynamics of (A.1)&(A.2) in the sense that its solution 
satisfies jy — x° as € — 0 either pathwise [FW84], or in the distributional 
sense [Kur73, MTV99]. 


B Asymptotics of Transition Times 


Here, we come back to the problem addressed in Sect. 3.3. In order to avoid 
coupling ¢ to the slow variable dynamics x we relax the postulation T;£, ;(x) > 
ord(1), i 4 j in Lemma 3.5 that is required for every x and i = 1,2. Instead 
of considering the transition times on every fibre of the fast state space for 
fixed x, we introduce the expected transition times Tie and Toci between 
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the metastable decomposition B® UB) in the entire (x,y) state space. This 
enables us to identify large time conformational changes with the stipulation 


= 


Tio, Tai > ord(1). 


We obviously have 


=e 


Té (x) 2 ord(1) = Tino Taes 2 ord(1), 


i>j 
whereas the other direction need not to be valid. 

With these preparations we claim the following: If we define the relation- 
ship between ¢ and e by 


small \ 1/2 
s(e) = (Z) ; (B.1) 


Vemall — min{V® (x) |z € D(a), i= 1,2}, 
where D(c) is some appropriately chosen bounded connected domain? of the 
slow variable state space, the metastable transitions Taso; Toci are of order 
one or even larger. 

In what follows we show how to compute Tij which is strongly con- 
nected to the asymptotic order of the transition times 7;%,; (x) on every fiber. 
We will consider two possible situations that are exemplary for the different 
approaches. We first examine the consequences of the asymptotic order of 
T;_,;(x) in general, and afterwards relate the results to the functions vi (x) 
and a coupling ¢ = ¢(e) given by (B.1). 

In Theorem B.1 below we assume the transition times 7;<, ;(x) to asymp- 
totically go to infinity, where we do not specify wherefrom the asymptotic 
investigations come from, that is, we leave open which parameter causes the 
asymptotic behaviour. Thus, possible (and reasonable) choices were ç — 0 
and € fixed, € — 0 so that ç(e) — 0, or, not less supposable, we could as- 
sume a scaling of the potential barrier. The next result becomes apparent 
in [Wal05, Chapter 3], where the approach is justified by means of multi- 
scale analysis for distinguished time scales. There the metastable transitions 
are assumed to happen on the longest time scale, which requires the averag- 
ing of the metastable transition rates (represented by the second eigenvalue 
of the corresponding generator) for fixed x wrt. the invariant density of the 


conditionally averaged potentials. 


Theorem B.1 ( [Wal05, Chapter 3.3.3]). Suppose T£,;(£) — œ almost 
everywhere for i,j = 1,2 andi 4 j. Then the metastable transition times ome 
are basically independent of the starting point and are asymptotically derived 


by means of averaging the x-dependent transition rates” against the invariant 


ê See explanation in Sect. 3.3. 
T Note, that we actually have to average the transition rates and not the transition 
times. 


Conditional Averaging Revisited 677 


probability distribution of the x dynamics conditioned upon remaining within 
the metastable set B® and taking the inverse of the averaged transition rates, 
that is, 
= 1 
Ff eae a 
Ezo [1/7 )] 


where the quantity Ezo [1/7 ;(x)] is given by 
BolT] = | TER” (e), (B.2) 
; 2 2 2 
p® (£) = ye (-3( — Sn Ds exp (— SV (eeu) ) .  (B.3) 


Here, Z denotes the normalization constant and depends on ¢ as well. 
Define the jump process I(t) by its transition rates TT s Then we find that 
the random process (%oy,Yoy) determined by system (4.4)B (4.5) is asymp- 
totically given by the SDE 


= -D zV (x y) F oW 
ja dat (2) (ym I(x) + SW 


In the limit of small noise ¢ the evaluation of the expression (B.2) asymptot- 
ically reduces to 


oe ioj) > (B.4) 


[uy (x x)w ; 
DO ee = p-ni (2 Nexp(-V e mO (x))) dz. 





LO 


Proof. We only have to show (B.4). The rest is verified in [Wal05]. First, we 
consider the averaged density ji (x): Using standard Laplacian asymptotics, 
we get for ¢ small 


ies exp (— SV (w,y))ay =g moe) exp (— SV (w,m(x))) (1+0), 


and, exploiting (s?/2) In(¢\/7/w(x)) —> 0 as ç > 0, we end up with the 
asymptotic limit (from aie 


(i) 
H (x yes a0 


Together with Theorem 3.3 we immediately obtain (B.4). 


exp (— SVU, m® (x))). 


Remark B.2. If we consider the asymptotics of the transition times for van- 
ishing ¢ and e€ fixed, it is easily seen that the assumptions of Theorem B.1 are 
fulfilled. However, if we consider the asymptotic limit for « — 0 and ¢ = ¢(e) 
as given by (B.1), the behaviour of 7;£,;(x) will depend on the course of the 


Ij 
functions VĆ a PE 12s 


bar 
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Fig. B.1. Exemplary possibilities for the functions VO (x), i = 1,2. In the left 
picture we find that the left and right potential barriers are bounded away from 
zero by a positive constant. This is prevented in the picture at the right, for vi 


bar 
converges to zero. 


In the situation illustrated in the left panel of Fig. B.1 we may apply the 
approach in Theorem for sure, if the relationship between ¢ and e€ is defined 
in an appropriate way. This is formulated in the next corollary. 


Corollary B.3. Suppose that inf {V? (x) |z eR, i=1,2} = Vem > 0 and 


bar 
vie > Val almost everywhere. Let us define the small noise intensity ¢ by 
/2 
OV nmal 1 
= | —2- K>0. B.5 
(= (FS) KS (B.5) 


Then, we obtain in the asymptotic limit € — 0 
= = 
Tin ( [1/78 j@) (a) az) 


Assume in addition that vo attains its smallest value and let the minimum 
of vi occur at, say, xo. Moreover, we assume vi > vi (x0) > VER! for 


all x # xo and Drs VH (xo) #0. Then 


Ezo [1/T2(#)] ~ (B.6) 


wC) (%9)wo(20) 1 2 1 € 2 (1) 
— m yo OXP(-GV(x0,m™ (a0) = exp(— -< Voar (20)). 
Aroa (ao) 20 a 


For 6 > 0 so that Vi) (x0) = (14+.6)Vsm"" we finally obtain 


bar 


Ti = ord(e~*s/In(1/e)). (B.7) 


Proof. The first part immediately follows from Theorem B.1, for we have 
almost everywhere 
v(a) = 14+ 6M) vee", 6 >0 = (e) = ord(e7®). 
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(B.6) follows from (B.4) by using Laplace’s method in the limit of vanishing 
noise ¢. With the assumed coupling of ¢ according to (B.5), we then obtain 


from (B.6) 
E,@[1/T_.2(2)] = ord(e~°/In(1/e)). 


One could also contemplate a situation, such as that schematically indi- 
cated at the right-hand side of Fig. B.1. Here, VO (x) — 0 as x — ov, and 


bar 
there is no local minimum V8™"" such that Vsm2l! < Ave. (x) for all x. Access 
to this problem is established in the next proposition where the argumenta- 
tion has to be carried out rather intuitively. As outlined in Remark B.2, in 
Proposition B.4 it is coercive to consider the asymptotic behaviour as € — 0 
together with a reasonable coupling of vi (x)/<s? that is not yet specified. 


r 


Proposition B.4. Suppose that min{ TE (x£), T4, (£)} — 0 asymptotically 
forx € D where D is some subset of positive Lebesgue measure. We define a 
decomposition of D = Dı U Də by 


Dı = {x € D | min{T_2(@), T1 (2)} = T2(2)}, 
Dz = {x € D | min{ Té a(z), T1 (#)} = Tz (w)}- 


To simplify argumentation, we assume that Di, i = 1,2 are connected sub- 
sets of D and (Dı N D2)\ô(Dı N D2) = Ø. Moreover, we restrict to the 
case where for x € D° with D° denoting the complement of D we have 
min{TE (£), T, (£)} — œ. Now, the following is satisfied: The metastable 
transition times T from B® to BO) will depend on the starting point 
Xo = x°(0) and we write Tis [£o]. For (x*(0), y*(0)) € B® with x*(0) = zo 
we asymptotically obtain 


Ti2lro] = Exo[To-up.(2*(t))], (B.8) 


where Tpeup2(x*(t)) denotes first exit time of the process x*(t) from the set 
D° U D?. Instead of considering the exit times of the process z€(t), we can 
equally well consider the exit times of the conditionally averaged dynam- 
ics (1.7) with I(t,x) = 1 fixed. In the limit of s > 0 we will be allowed 
to replace xf in (B.8) by the small noise approximation or the OU averaged 
dynamics as defined in Sect. 4 and still obtain the correct asymptotics. And, 
conclusively, by using (5.2), we arrive for vanishing ¢ at 


Ti olto] = Eno[tpeup,(2°(t))] = ord(1) 
where x°(t) is determined by 
i = —D,V(2,m™) + oW. 


In exact the same way we obtain asymptotics for Tosi 
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Fig. B.2. left: Illustration of V2, i = 1,2 and V£! and the resulting subsets 
Dı, D2, D°; middle: transition probabilities (time step dt = 0.01) corresponding 
to (B.1) with vam! < v% (m) and e = 107%; right: transition probabilities with 
e= 10m2 


Proof. A careful inspection of the transition probabilities p{,(t, 2) as defined 
in (6.2) and (6.3) with gf;(x) = 1/7;_,, reveals for each time step dt pointwise 
convergence for almost every x 


lim pia(dt, x) =0, rE D,UD*, lim pj (dt, x) =1, re Dy. 


This shows that for e small enough, the particle in B® will instantly jump 
over the barrier once it has reached Dı and as long as it stays in Də U D° 
nothing will happen. 


Example B.5. Let vie be given as illustrated in the left picture® of Fig. B.2, 
that is, via is strictly monotonically decreasing with vio (x) — 0 as x > œo, 
and vy is strictly monotonically increasing with vy (x) > 0 as z > —o0. 
: ; . . 1 2 
Then there exists an intersection point m such that ve? (m) = v2) (m). Now, 
choose Vs"" such that Vm" < VČ (m) and define the relation between e 
and ç according to (B.1). The resulting subsets Dı, D2 and D° = (D, U 
Dz)° are shown in Fig. B.2. The picture in the middle shows the transition 
probabilities pj.2 = p{o(dt,x), po.1 = p§,(dt,x) to jump over the barrier 
for moderately chosen e = 10~° and time step dt = 1/100. At the right we 
illustrate the transition probabilities for very small € = 10~!%. We clearly 
observe that for vanishing e€ the particle will jump over the barrier once it has 
reached Dı and Dz, respectively. 


Example B.6. Let the assumptions be given as in Example B.5, but this time 
we choose Vsmell > vE (m). In this case, D° = Ø and the state space is 
decomposed into the sets Dı and Də that are separated by the point m with 
vi (m) = 0) (m). An illustration is given in Fig. B.3. Again, the transition 
probabilities for e = 107!” at the right-hand side reveal 


pı=2 = pioldt, z) ~ 0 forz € Do, posi =pio(dt,x) ~ 1 for re Dj. 


8 We have chosen the potential from Sect. 6. 
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Fig. B.3. Same as Fig. B.2, but this time VEe! > vy% (m). 
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B. Nestler, F. Wendler: 
Simulations of Complex Microstructure Formations. 


| ee a c) 


Fig. 1.1. Experimental micrographs of Al — Si alloy samples: a) Grain structure 
with different crystal orientations, b) network of primary Al dendrites and c) inter- 
dendritic eutectic microstructure of two distinguished solid phases in the regions be- 
tween the primary phase dendrites. (This figure is displayed in the text on page 114.) 
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Schematical Drawing Grain Structure Multiphase Solidification 





Fig. 2.1. Left image: Schematic drawing of a domain separation by four different 
phase regions; Middle image: Polycrystalline grain structure in Al-Si; Right image: 
Multiphase solification microstructure with dendrites and a eutectic structure. (This 
figure is displayed in the text on page 117.) 
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Fig. 2.2. 3D surface plot of a) a smooth and b) a facetted cubic anisotropy. c) 
contour plots of two adjacent growing, 45° misoriented cubic crystals applying the 
smooth anisotropy formulation in Eq. (2.5) with 6 = 0.2. (This figure is displayed 
in the text on page 120.) 
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Fig. 2.3. Plot of the multi-well potential ws:(y) for N = 3 and equal surface entropy 
densities yag. (This figure is displayed in the text on page 121.) 





Fig. 2.4. Plot of the multi-obstacle potential w.,(y) for N = 3 and equal surface 
entropy densities yag. (This figure is displayed in the text on page 122.) 
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Fig. 4.5. 3D Ni dendritic growth at an undercooling of A = 0.6. a) single chan- 
nel dendrite, b) dendritic array with an orientation inclination of 15° with rsp. to 
normal, c) equiaxial dendrite. (This figure is displayed in the text on page 135.) 


Fig. 4.8. Establishment of regular lamellar solidification at the eutectic composition 
in 2D (a) and 3D (b). (This figure is displayed in the text on page 138.) 
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Fig. 4.9. Regular oscillations along the solid-solid interface driven by the motion of 
the triple junction/triple line in 2D. (This figure is displayed in the text on page 138.) 
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Fig. 4.10. Topological change of the microstructure due to oscillations along the 
solid-solid interface in 3D. (This figure is displayed in the text on page 139.) 





Fig. 4.12. Formation of a 3D hexagonal rod-like structure in a ternary eutectic 
system with isotropic surface energies and three different solid phases a, and y. 
(This figure is displayed in the text on page 139.) 





Fig. 4.15. Selection process in a polycrystalline dendritic front: The colours in a) 
and b) indicate the orientations of the dendrites for two different time steps, whereas 
in c) the Ni concentration is shown. (This figure is displayed in the text on page 141.) 
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Fig. 4.16. Three time steps of two misaligned NiCu grains in 3D starting from 
a rough planar initial state (lateral periodic boundary conditions). (This figure is 
displayed in the text on page 142.) 





Fig. 4.17. Growth of dendritic NiCu grains into a 20 K undercooled melt illustrated 
by the Ni concentration (range: 0.41—0.62). The complete solidification (right image) 
is reached after further reducing the temperature by 15 K in a second step. (This 
figure is displayed in the text on page 143.) 
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Fig. 4.18. a) and b): Growth of a polycrystalline NiCu structure with 30 grains on 
a domain of size 100 x 100 x 100. The isosurfaces of selected grains for two time 
steps are displayed. c): Heat treatment with partial melting along the grain vertices 
(from [WN06]). (This figure is displayed in the text on page 143.) 





Fig. 4.21. Periodic test geometry with an inserted melt inclusion (light grey) at the 


horizontal boundary, shown for three time steps (from left to right). (This figure is 
displayed in the text on page 146.) 





Fig. 4.22. Coarsening of a grain structure with fluid inclusions (light grey) at the 
triple junctions for three time steps and for a ratio of the surface energies Yss/7/s1 = 
1.2. (This figure is displayed in the text on page 146.) 
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Fig. 4.24. Polycrystalline grain growth in a crack-seal process with facetted 
anisotropy of the surface energies of the grain boundaries. a) and b) show three 
time steps of two simulation runs with different starting grain distribution. (This 
figure is displayed in the text on page 148.) 





Fig. 4.25. Effect of different shear rates on the resulting morpholgy. From left to 
right: no shear, Ax = 10 cells, Ax = 20 cells. Two simulation runs a) and b) with 
different grain distributions are displayed. (This figure is displayed in the text on 
page 149.) 
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Garcke, Lenz, Niethammer, Rumpf, Weikard: 
Multiple Scales in Phase Separating Systems 
with Elastic Misfit. 





Fig. 1.1. Evolution starting from a perturbation of a uniform state. (This figure is 
displayed in the text on page 154.) 


Fig. 1.2. Alignment of interfaces driven by homogeneous, anisotropic elasticity. 
(This figure is displayed in the text on page 154.) 












Fig. 4.2. Effects of inhomogeneous elasticity: On the left side the green phase is 
the elastically harder one, the blue phase is softer. On the right side it is vice versa. 
The volume fraction of both phases are the same. (This figure is displayed in the 
text on page 166.) 
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10.175 


Fig. 5.1. Graph of the energy at an early and a very late stage of the evolution 
(two graphs on the left side), different time steps of the evolution (on the right side). 
(This figure is displayed in the text on page 167.) 
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Fig. 5.2. Graph of the nonelastic part of the energy. (This figure is displayed in the 
text on page 168.) 
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Fig. 7.4. The trace of the elastic strain (both top rows, compared to zero strain and 
eigenstrain), the trace of the stress (middle) and energy density (bottom) for the 
Mullins-Sekerka (left) and the reduced model (right), in the initial configuration. 
(This figure is displayed in the text on page 175.) 
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Bartels, Carstensen, Conti, Hackl, Hoppe, Orlando: 
Relaxation and the Computation of Effective Energies 
and Microstructures in Solid Mechanics. 


Fig. 5.1. Single-slip plasticity (a) First order laminates and (b) Second order lami- 
nates as from (3.12) and (3.16) respectively, assuming periodic boundary conditions. 
(This figure is displayed in the text on page 218.) 


S. Conti, G. Dolzmann: 
Derivation of Elastic Theories for Thin Sheets and 
the constraint of incompressibility. 
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(b) 

Fig. 4.2. (a) Modification of the map in Figure 4.1 in order to remove the singularity. 
(b) Plot of one half of surface in (a). This view illustrates that the modification is a 
smooth surface in R? for which the tangent plane in each point has full rank. (This 
figure is displayed in the text on page 235.) 
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F. Bornemann, C. Lasser, T. Swart: 
Energy Level Crossings in Molecular Dynamics. 





Fig. 1.1. The energy levels of the model for the cis-trans isomerization of retinal in 


3a 
2 


rhodopsin of [HS00]. They cross, when the angular variable ¢ is approximately 5 or 
and the collective coordinate vanishes. In the plot, the abscissa corresponds to 


ġ, the ordinate to y. The two local minima of the lower energy level are associated 
with the cis and the trans configuration of the molecule. (This figure is displayed in 


the text on page 579.) 
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Fig. 3.1. Resonances of the model operator H = —e?A, + V (q). The parameter 
k lies in {11, 12,...,60}, while v is chosen in {1.5,2.5,...,5.5}. The semiclassical 
parameter e varies from 107° to 1. (This figure is displayed in the text on page 591.) 
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Dellnitz, Hessel-von Molo, Metzner, Preis, Schiitte: 
Graph Algorithms for Dynamical Systems. 
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Fig. 3.2. Partition of graph describing the dynamics Pentane300. Left: The graph 
with all edges. Center: Partition consisting of five parts with Cint = 0.980. Right: 
Partition consisting of seven parts with Cint = 0.963. (This figure is displayed in the 
text on page 629.) 
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Fig. 3.3. Congestion of the Pentane. Left: all transitions. Center: only transitions 
with a low congestion. Right: only transitions with a high congestion. (This figure 
is displayed in the text on page 632.) 
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Cint = .987 
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Fig. 3.4. Recursive bisection of the graph for the pentane example. The values 
indicate: number of boxes / invariant measure / internal cost Cine / congestion of 
subgraph. The graph has 255 vertices and a congestion of 139.67. (This figure is 


displayed in the text on page 635.) 
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Fig. 4.5. Left column: Family of most probable paths between the set A (left min- 
imum) and the set B (right minimum). From the top to the bottom we choose 
€ = 0.1,€ = 0.3 and € = 0.6. Right column: Family of paths which crosses the lowest 
free energy barriers. From the top to the bottom we choose e = 0.01, € = 0.05 and 
e = 0.13. (This figure is displayed in the text on page 643.) 


